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CANONICAL TREES, 

COMPACT PREFIX-FREE CODES AND 
SUMS OF UNIT FRACTIONS: 

A PROBABILISTIC ANALYSIS 

CLEMENS HEUBERGER, DANIEL KRENN, AND STEPHAN WAGNER 

Abstract. Eor fixed t > 2, we consider the class of representations of 1 as sum of unit 
fractions whose denominators are powers of t or equivalently the class of canonical compact 
t-ary Huffman codes or equivalently rooted t-ary plane “canonical” trees. 

We study the probabilistic behaviour of the height (limit distribution is shown to be 
normal), the number of distinct summands (normal distribution), the path length (normal 
distribution), the width (main term of the expectation and concentration property) and the 
number of leaves at maximum distance from the root (discrete distribution). 


1. Introduction 


We consider three combinatorial classes, which all turn out to be equivalent: partitions 
of 1 into powers of t, canonical compact t-ary Huffman codes, and “canonical” t-ary trees, 
see the precise discussion below. In this paper, we are interested in the structure of these 
objects under a uniform random model, and we study the distribution of various structural 
parameters, for which we obtain rather precise limit theorems. Let us hrst define all three 
classes precisely and explain the connections between them. Throughout the paper, t >2 will 
be a fixed positive integer. Figure o shows examples in the case t = 2. 

(1) Partitions of 1 into powers of t (representations of 1 as sum of unit fractions whose 
denominators are powers of t) are formally defined as follows: 


T ^ 

C Partition — ■ j ^t) G ^ T ^ 0, 0 ^ Xi ^ X2 ^ ^ Tt, ^ — l|'. 


i=l 


The external size |(xi,..., a:.r)| of such a representation (xi,..., Xr) is dehned to be 
the number r of summands. 

(2) Secondly, we consider canonical compact t-ary Huffman codes: 


C-code = {C C {1,..., t}* I C is prefix-free, compact and canonical}. 


Here, we use the following notions. 

• {!,..., t}* denotes the set of finite words over the alphabet t}. 

• A code C is said to be prefix-free if no word in C is a proper prefix of any other 
word in C. 


2010 Mathematics Subject Classification. 60G05, 05A16. 

Key words and phrases. Ganonical Gary trees, compact prefix-free codes, unit fractions, limit theorems. 

G. Heuberger and D. Krenn are supported by the Austrian Science Fund (FWF): W1230, Doctoral Program 
“Discrete Mathematics” and the Austrian Science Fund (FWF): P24644-N26. 

S. Wagner is supported by the National Research Foundation of South Africa, grant number 70560. 

An extended abstract with less general results without proofs appeared as m- 

1 




2 


CLEMENS HEUBERGER, DANIEL KRENN, AND STEPHAN WAGNER 


• A code C is said to be compact if the following property holds: if tc is a proper 
prefix of a word in C, then for every letter a G {1,... ,t}, wa is a prefix of a word 
in C. 

• A code C is said to be canonical if the lexicographic ordering of its words 
corresponds to a non-decreasing ordering of the word lengths. This condition 
corresponds to taking equivalence classes with respect to permutations of the 
alphabet (at each position in the words). 

The external size \C\ of a code C is defined to be the cardinality of C. 

If C e Ccode with C = {rci,..., Wr} and the property that length(uij) < length(rci+i) 
holds for all i, then (length(tci),..., length(t(;T-)) £ CpartiUon- This is a bijection between 
Ccode and CPartition preserving the external size. This connection can be explained 
by the Kraft-McMillan inequality EniE2], which states that for any prefix-free code 
C = {tci,..., Wr}, one must have 


T 

^ ^-length(«)i) ^ 2 
i=l 


and compact codes are precisely those for which equality holds (meaning that they are 
optimal in an information-theoretic sense). 

(3) Finally, both partitions and codes are related to so-called canonical rooted t-ary trees: 


CTree = {T rooted t-ary plane tree | T is canonical}. 


Here, we use the following notions. 

• t-ary means that each vertex has no or t children. 

• Plane tree means that an ordering “from left to right” of the children of each 
vertex is specified. 

• Canonical means that the following holds for all k: if the vertices of depth 
(i.e., distance to the root) k are denoted by ui, ..., vk from left to right, then 
deg(ui) < deg(uj+i) holds for all i. 

The external size \T\ of a tree is given by the number of its leaves, i.e., the number of 
vertices of degree 1. 

If C G Ccode, then a tree T G Cpree can be constructed such that the vertices of T 
are given by the prefixes of the words in C, the root is the vertex corresponding to the 
empty word, and the children of a proper prefix w oi a code word are given from left 
to right by wa for o = 1, ..., t. This is a bijection between Ccode to Cpree preserving 
the external size. 

Further formulations, details and remarks can be found in the recent paper of Elsholtz, 
Heuberger and Prodinger m- We will simply speak of an element in the class C when the 
particular interpretation as an element of CPartition, Ccode or Cpree is not relevant. Our proofs 
will use the tree model, therefore Cpree is abbreviated as T. 

The external size of an element in C is always congruent to 1 modulo t — 1. This can easily 
be seen in the tree model, where the number of leaves r and the number of internal vertices n 
are connected by the identity 

T = 1 -|- n{t — 1). 

Therefore, we will from now on consider the internal size: for a tree T G CjYee the internal 
size of T is the number n{T) of internal vertices, for a code C G Ccode the internal size is 
the number of proper prehxes of words of C, and for a partition (xi,..., Xr) G CpartiUon the 
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Figure 1.1. All elements of external size 5 (and internal size 4, respectively) 

in F Tree ; ^ Code and (^Partition foL ^ — 2. 


internal size is defined to be (r — l)/{t — 1). We will omit the word “internal” and will always 
use the variable n (or n{T) for a specific element T £ C) to denote the size. 

The asymptotics of the number of elements in C of size n has been studied by various 
authors, see the historical overview in m- Special cases and weaker versions (without explicit 
error terms) of the following result, which is given in [11] (building upon the generating 
function approach by Flajolet and Prodinger M), were obtained earlier and independently 
by different authors (Boyd [5], Komlos, Moser and Nemetz [19| . Flajolet and Prodinger [14] 
and Tangora pH]l. 

Theorem I f[TT|l. For t >2, the number of elements of size n in C is (in Bachmann-Landau 
notation) given by 

Pp"+i + 0(p^), 

where p > P 2 and R are positive real constants depending on t with asymptotic expansions (as 
t ^ oo) 


p = 2- 


1 



/C'2 = 1 + 


log 2 
t 


+ 0 





t-2 



In fact, all O-constants can be made explicit and more terms of the asymptotic expansions 
in t of p, p 2 and R can be given. 

In spite of the fact that the counting problem has been studied independently by many 
different authors, to the best of our knowledge the structure of random elements has not been 
considered before. Thus the purpose of this contribution is to study the probabilistic behaviour 
of various parameters of a random element in C of size n. We always use the uniform random 
model: whenever a random tree (equivalently, partition or code) of a given order n is chosen, 
all elements are considered to be equally likely. 

(1) The height h{T) of a tree T £ Cpree is defined to be the maximum distance of a leaf 
from the root. In the interpretation as a code, this is the maximum length of a code 
word. In a representation of 1 as a sum of unit fractions, this corresponds to the largest 
denominator used (more precisely, to the largest exponent of the denominator). 
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The height is discussed in Section It is asymptotically normally distributed with 
mean ~ /i/jU and variance ~ where 




1 t-2 


2t+3 




and 


^h-^ + 


1 -f + 5t-2 


2t+4 




cf. Theorem m Moreover, we prove a local limit theorem. 

(2) The number of distinct summands of a representation (xi,... ,Xt) of 1 as sum of 
unit fractions is denoted by d{xi,... ,Xt)- In the tree model, this corresponds to the 
cardinality d{T) of the set of depths of leaves in a tree T G Cxree- III the code model, 
this is the number of distinct lengths of code words. 

The number d{T) is studied in Section It is asymptotically normally distributed 
with mean ~ and variance ~ where 


2 2^+3 



and 


2 1 -t^ + 9t-14 ^ r 


cf. Theorem IV Moreover, a local limit theorem is proved again. 

(3) The maximum number of equal summands of a representation (xi,..., Xt) of 1 as sum 
of unit fractions is denoted by w{xi ,..., Xr). In the code model, this is the maximum 
number of code words of equal length. In the tree model, this is the “leaf-width” w{T), 
i.e., the maximum number of leaves on the same level. 

The number w{T) is studied in Section]^ We prove that E(u)(T)) = //^logn -f 
O(loglogn) with = l/(tlog2) -|- 0(l/t^) and a concentration property, cf. Theo¬ 
rem 0 

(4) The (total) path length £{T) of a tree T G CTree is defined to be the sum of the depths 
of all vertices of the tree. In our context, it is perhaps most natural to consider the 
external path length lexternai{T)i though, which is the sum of depths over all leaves of 
the tree, as this parameter corresponds to the sum of lengths of code words in a code 
C G Ccode- Likewise, the internal path length iintemai{T) is the sum of depths over all 
non-leaves. Clearly, we have (-extemaiiT) + ^internai{T) = ^{T)i and the relations 


iexternai{T) = -f n{T) and 


£internal{T) - ^i{T) uiT) 


for t-ary trees are easily proven. Therefore, all distributional results for any one of 
those parameters immediately cover all three. The total p ath l ength turns out to be 

asymptotically normally distributed as well (see Theorem VII), with mean 
3 


and variance 


h-tpin- 


The coefficients have asymptotic expansions 


l^tpi 


t t ^ t{t — 2) 

2 ' C'h- 2t-r4 


+ 0 



and 


(^tpi — Y2~^ 


-f 5t^ - 2f 


3 • 2^+4 


+ 0 ^ 


The path length is studied in Section Its analysis is based on a generating function 
approach for the moments, combined with probabilistic arguments to obtain the central 
limit theorem. 

(5) The number of leaves on the last level (i.e., maximum distance from the root) of a 
tree T G Cxree is denoted by m{T). This corresponds to the number of code words of 
maximum length and to the number of smallest summands in a representation of 1 as 
a sum of unit fractions. 
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This parameter may appear to be the least interesting of the parameters we study. 
However, it is a natural technical parameter when constructing generating functions 
for the other parameters. From these generating functions the probabilistic behaviour 
of m{T) can be read off without too much effort, so we do include these results in 
Section |6l 

The limit distribution of m( T) is a discrete distribution with mean 2t + o(l) and 
variance + o(l), cf. Theorem |Vl[ 

A noteworthy feature of the results listed above is the fact that the distributions we observe 
are quite different from those that one obtains for other probabilistic random tree models. 
Specifically, the parameters differ from the ones of Galton-Watson trees (which include, 
amongst others, uniformly random t-ary trees), but also from the ones of recursive trees and 
general families of increasing trees. See [7] for a general reference. In particular, 

• the asymptotic order of the height of a random Galton-Watson tree of order n is 
only ^/n, and it is known that the limiting distribution (which is sometimes called a 
Theta distribution) coincides with the distribution of the maximum of a Brownian 
excursion |12] . The height of random recursive trees (or other families of increasing 
trees) is even only of order log n, and heavily concentrated around its mean, see |6] . 

• The path length of random Galton-Watson trees is of order , and it follows an Airy 
distribution (like the area under a Brownian excursion) in the limit [26] . For recursive 
trees, the path length is of order n log n with a rather unusual limiting distribution m- 

• While the height of our canonical trees is greater than that of Galton-Watson trees, 
precisely the opposite holds for the width (as one would expect): it is of order ^/n 
for Galton-Watson trees ilEZl, with the same limiting distribution as the height, as 
opposed to only logn in our setting. For recursive trees, the width is even of order 
n/y/logn, see jU]. 

Indeed, the structure of our canonical t-ary trees is comparable to that of compositions: 
Gounting the number of internal vertices on each level from the root, we obtain a restricted 
composition, in which each summand is at most t times the previous one. In the limit t —)• oo 
one obtains compositions of n starting with a 1 in this way. The recent series of papers by 
Bender and Canfield mmm and Bender, Canfield and Gao |1| is concerned with compositions 
with various local restrictions. In fact it would be possible to derive the central limit theorems 
for the height and the number of distinct summands from Theorem 4 in |2], but in a less 
explicit fashion (without precise constants, and further work would still be required for a local 
limit theorem). A parameter related to the “leaf width” (the largest part of a composition) is 
also studied in |1|, but in addition to the fact that the parameters are not quite identical, it 
also seems that the technical conditions required for the main result of |1] are not satisfied 
here. 

Lastly, a remark on numerics and notation. Throughout the paper, various constants occur 
in all our major results, and we provide numerical values for small t as well as asymptotic 
formulae for these constants in terms of t. The error terms that occur in these formulae 
have an explicit O-constant, which is indicated by error functions ej{. .These functions 
have the property that \£j{- . .)| < 1 for all values of the indicated parameters. All results 
were calculated with the free open-source mathematics software system SageMath |24| and 
are available onlin^The numerical expressions were obtained by using interval arithmetic, 

^The worksheets containing the calculations can be found at http://www.danielkrenn.at/ 
unit-frac-parameters-full 
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therefore they are reliable results. Each numerical value of this paper is given in such a 
way that its error is at most the magnitude of the last indicated digit. It would be possible 
to calculate the values with higher accuracy. Determining accurate numerical values and 
asymptotic formulae is not just interesting in its own right, it is also important for some of 
our theorems: specihcally, for all Gaussian limit laws it is crucial to ensure that the growth 
constants associated with the variance are nonzero. We will therefore comment repeatedly on 
how reliable numerical values can be obtained. 


2. The Generating Function 


In this section, we derive the generating function which will be used throughout the article. 

The analysis of the path length (Section also requires results on canonical forests. For 
r > 1, we consider the set of canonical forests with r roots. These r roots are all on the 
same level and ordered from left to right. The notion “canonical” introduced for trees here 
is meant to hold over all connected components of the forest. This means that a forest may 
not be seen as a collection of trees, but rather as the subgraph of a canonical tree induced by 
its vertices of depths > d for some d. In fact, this is also the interpretation for which we will 
need results on forests. We will phrase the generating function in terms of forests, but most 
other results will be formulated for trees only. 

The height h{T), the cardinality d{T) of the set of different depths of leaves, and the number 
m{T) of leaves on the last level of a fores10 T £ of size n = n{T) can be analysed by 
studying a multivariate generating function H{q, u, v, w), where q labels the size n(T), u labels 
the number m{T) of leaves on the last level, v labels the cardinality d{T) of the set of depths 
of leaves and w labels the height h{T). 


Theorem II. The generating funetion 


MT)^MT)d(T)^h(T) 


H{q,u,v,w):= ^ 'u""'-"'v'"'"'w" 

T&Tr 


ean he expressed as 


H{q, u, V, w) = a{q, u, v, w) + b{q, u, v, w) 


a{q,l,v,w) 

1 - b{q,l,v,w) 


with 


a(g, u, u, u;) = ^ n 1|—, 


j=0 


2=1 


b{q,n,v,w) = E n - 

where [jj := 1 + t + • • • + 

The funetions a{q,u,v,w) and b{q,u,v,w) are analytie in {q,u,v,w) when 

1 


kl< 


\u\ 


■ 


( 2 . 1 ) 


( 2 . 2 ) 


^We use the symbol T (instead of F) for a canonical forest in since we usually look at the special case 
r = 1, where T is a tree. 
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When u = 1, the generating function can be simplified to 

a{q,l,v,w) 


H{q, l,v,w) = 


(2.3) 


1 - b{q,l,v,w)' 

The proof of Theorem depends on solving a functional equation for the generating 
function. As we will encounter similar functional equations for related generating functions in 
Section we formulate the relevant result in the following lemma. 

Lemma 2.1. Let P C C 6e the closed unit disc and q € C with |g'| < 1. Let P, R, S, f be 
bounded functions on V and s be a constant such that |5(u)| < s < 1 for all u G V. 

If 

f{u) = P{u) + R{qu^)f{l) + S{qu^)f{qu*) 

holds for all u G P, then 

a(l) 


f{u) = a{u) + b{u) 


1 - 6 ( 1 ) 


holds with 


oo J 

a{u) = ^ P{q^^^u^' )Yl S{q^^u*') 


j=0 


2=1 


1*1 


b{u) = ^ 

j=l i=l 

provided that 6(1) / 1. 

Proof. We iterate the functional equation ( |2.4[ ) and obtain 

f{u) = ak{u) + bk{u)f{l) + Ck{u)f{q^’^K*'°) 


for k > 0 with 


fe-i 


1*1 


ak{u) = J2P{qVK^^)l[S{q^^ 

j=0 i=l 

h(u) = f:R(g>^lu‘-)US(gl‘lu‘'), 

j=l i=l 

Ck{u) = n 5(gWri*'). 

2 = 1 

The assumption |g| < 1 implies that limfc^oo = 0 for |rt| < 1. Therefore, 


lim Ofc(ri) = a{u), lim bk{u) = 6(n), 

k^oo k^oo 


lim Ck{u) = 0, 
k^oo 


for u gP and the functions a{u) and b{u) given in (2.6). 
Taking the limit in (2.4), we get 

f{u) = a{u) + 6(n)/(l) 


for u gP. Setting n = 1 in (2.7) yields (^2.5L 


(2.4) 

(2.5) 


( 2 . 6 ) 


(2.7) 

□ 
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Proof of Theorem\^ The proof of Theorem [lT| follows ideas of Flajolet and Prodinger m, 
see also m- We first consider 

Hh{q, u, v) := [w^]H{q, u, v,w) = Y. 

TeTr 

h{T)=h 

for some h > 0. 

A forest T' of height h + 1 arises from a forest T of height h by replacing j of its m{T) 
leaves on the last level (for some j with 1 < j < m{T)) by internal vertices, each with t leaves 
as its children. If j = m{T), then all old leaves become internal vertices, so that d{T') = d{T). 
Otherwise, i.e., if j < m{T), at least one of them becomes a new leaf, meaning that we have a 
new level that contains one or more leaves, hence d{T') = d(T) + 1. 

For the generating function this translates to the recursion 

m(r)-l 

Hh+i{q,u,v)= Y ( E 

TeTr ^ i=i 
h(T)=h 

= Y 

T&Tr 
h(T)=h 

= R{q, qu\v)Hh{q, 1, v) + S{q, qu\ v)Hh{q, qu\ v), 


qu^y i + (1 _ y)(^qut)^(^)\ 

^ 1-qu^ J 


( 2 . 8 ) 


where we set 

R{q, u, v) = S{q, u, v) 

1 — u 

Note that the initial value is given by HQ{q,u,v) = vJ'v. 
Now set 


1 — V — u 
1 — u 


^0 ■= {{q^u, 0 , 71 :) E I \q\ < 1/5, |?x| < 1, |n — 1| < l/f),\w\ < 1}. 

We note that if (g, n, v, w) E Pq) we have 

|i^(g,g'U^n)| < ^, \S{q,qu\v)\ <^. 

This and (^) imply that \Hh{q,u,v)\ < (6/5)(4/5)^ holds for /i > 0 and {q,u,v,w) E Vq. 
Thus H{q, u, v, w) = J2h>o ^h{q, u, v)w^ converges uniformly for (gr, n, n, w) E Vq. 

Multiplying (2.8) by and summing over all h > 0 yields the functional equation 

H{q, u, V, w) = V + wR{q, qvf, v)H{q, 1, n, w) + wS{q, qvf, v)H{q, qvf, v, w). 


Lemma 2.1 immediately yields ( |2.1| ). 

Let now 

Vi = {{q,u,v,'w) E I \qv!'~^\ < 1}. 

We clearly have Vq C Vi. For {q, u, v, w) E Pi, we have 

hm = hm (?-Vh-i)(gi/(t-i)^)*'' =0. 

k^oo k^oo 

Therefore, a{q,u,v,w) and b{q,u,v,w) are analytic in Pi. 


□ 


In the following lemma, we also state a simplified expression and a functional equation for 
b{q, u, V, w) in the case v = 1, w = 1. 
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t 

<7o 

Q 

2 

0.5573678720139932 

0.7131795784312742 

3 

0.5206401166257250 

0.6307447647757403 

4 

0.5090030531391631 

0.5930691701039086 

5 

0.5042116835293617 

0.5720078345052473 

6 

0.5020339464245723 

0.559428931713329 

7 

0.5009982119507272 

0.550735002693058 

8 

0.5004941016343997 

0.544259198784997 

9 

0.500245704703080 

0.539248917438516 

10 

0.5001224896234884 

0.535257359027998 


Table 1. Constants qo and Q for 2 < t < 10. For the accuracy of these 
numerical results see the note at the end of the introduction. 


Lemma 2.2. We have 


b{q,u,l,l) = IT' 


qu 


:(1 - b{q,qu\l,l)). 


j-i 1 — 1 — qvh 

In partieular, the eoeffieient [u^]b{q,u,l,l) vanishes if j is not a multiple oft. 


Proof. This is an immediate consequence of (2.2). 


□ 


Next we recall results on the singularities of H{q, 1,1,1), see Proposition 10 of [H]. We 
use functions Sj for modeling explicit 0-constants as it was mentioned at the end of the 
introduction. 

Lemma 2.3. The generating function H(q,1,1,1) has exactly one singularity q = qo with 
|g| < 1 — This singularity qo is a simple pole and is positive. For t > 4, we have 

_ 1 1 t + 4: 3t‘^ + 23t + 38 7t^ 

® - 2 + ^ + f2t+5 + 23^+8 ^ 100 •24*^^^^^' 

For t G {2,3}, the values are given in ra&le[^ Furthermore, let 

^ 1 log 2 0.06 

^“2 + ^ + 1^ 

for t > Q and Q be given by Table\^for 2 < t < 5. Then qo is the only singularity q of 
H{q, 1,1,1) with |g| < qo/Q- 

Setting U = 1 — for t > 2 and U = 1 — for t = 2, we have the estimate 




(2.9) 


These results do not depend on the choice of the number of roots r. 


Proof. By m Proposition 10], the function 1 — b{q, 1,1,1) has a unique simple zero q = qo 
with |g| < 1 — 0.72/t and no further zero for |g| < qo/Q] the asymptotic estimates for qo and 
Q follow from the results given in m- 
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At this point, we still have to show that the numerator does not vanish in go- We note that 
Qo < 3/5. Using jTTJ Lemma 8], we obtain 


|a(go, 1, 1 , 1) - 1| < + Y. 11 

i — On r, . 


J*1 

% 


9 83038203 

< — + _ < 1 . 


Qo 


i=2 


=1 1 - ~ 10 903449750 


Therefore, 


holds uniformly in r. 


1,1,1) = 0(1) 


( 2 . 10 ) 


For t > 30, the estimate (2.9) follows from the asymptotic expressions. For t < 30, it is 
verified individually. □ 


Using this result, we will be able to apply singularity analysis to all our generating functions 
in the coming sections. At this point, we restate Theorem |I] on the number of trees taking the 
notations of Theorem |IT] into account and extend it to the number of canonical forests with r 
roots. 


Lemma 2.4. For r > 1, let 


p(r) = 


«(®, 1,1,1) 


qoigKq^ i, i, i) 


9=90 


where a{qo, 1,1,1) is taken in the version with r roots. 

Then 

v{r) = 0(1) 

uniformly in r >1 and the number of eanonieal forests with r roots of size n is 

u{r) 


qE 


-(1 + 0(Q")), 


( 2 . 11 ) 

( 2 . 12 ) 

(2.13) 


also uniformly in r > 1. 

Proof. By singularity analysis [13 US], Lemma |2.3| and Theorem the number of canonical 
forests with r roots of size n is 


— Res 




,,n+l 


q = qo] +0 


v{r) 


+ 0 


(2.14) 


The O-constant can be chosen independently of r as a{q, 1,1,1) can be bounded independently 
of r for |g| = qp/ Q. 

The estimate (2.10) immediately yields (2.12). Combining this with ( |2.14 ) yields (2.13). □ 

When analyzing the asymptotic behavior of the height (Section]^, the number of leaves on 
the last level (Section]^ and the path length (Section]^, the corresponding formulae contain 
the infinite sum h{q, u,l,w) and its derivatives. In order to perform the calculations to get 
the asymptotic expressions in t as well as certihable numerical values for particular t, we will 
work with a truncated sum and bound the error we make. We define 


hj{q,u,l,w) = - Y (-l)'^'n 

l<j<J i=l ” 


Note that the variable v encoding the distinct depths of leaves is handled separately in 
Lemmata 12.71 and 12.81 

The following lemmata provide the estimates we need. 
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Lemma 2.5. Let J e N and q, u, w G C with \qu^ ^| < 


Q = I'w^l 


kl 


y+n 


i-kl 


[^+11 


and suppose that Q <1 holds. Then 


1 I In'** 

\b{q,u,l,w) - hj{q,u,l,w)\ < \w\-^ .-— , 

' *=i 1 - gWii*' 


1 


l-Q' 


Note that as ^| < 1, the error bound stated in the lemma is decreasing in J. 
Proof of Lemma 2^. Set 

^ . .X oWii** 

R = b{q, u, 1, in) - bj{q, u, l,w) = - ^ 

j>j 1=1 " 


T i — 
2=1 ^ 


As is decreasing in i, we have 


w 


TT" 

}J ■ * 




< b/iP 




= \w\-^Q^-Jr[ 

1=1 1 — gWtt** 




n 




kr*“ \u\ 


for j > J. This leads to the bound 


i^i<kK n 




^ |l — qhly' 
which we wanted to show. 


EQ^-^=HMn 




j>J 


:=1 1 — gWtt*’ 


l-Q’ 


□ 


We also need to truncate the infinite sums of derivatives of b{q, u,l,w). This is done by 
means of the following lemma. 

Lemma 2.6. Let J G N and a, /3, 7 G No, and let q G C with |q| < |. Suppose 
• either u = 1, U = 1 and /d = 0, 

or u G C with |m| < 1/U — for U defined in Lemma 


2.3 


holds. Further, let w G C with |u;| < |. Set 

Q = l 


3 ^ 6 ^ ['^+11 jyl-l+l _ 2 
and suppose J was ehosen sueh that Q <1 holds. Then 

Qa+l3+'Y 


<a!/3!7!(*VlogV2)'’6“+T(^0''(n 


t"! - 1 ) l-Q’ 
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Proof. Let i? £ C with |i?| < l/?7 and r/ £ C with \r]\ < Cauchy’s integral formula gives 




(e - 


de 


The bound on q implies |^| < Using the standard estimate for complex integrals, (2.9) and 


Lemma |2.5| yield 

ga 


dq^ 


(6(g,i?,l,T/) - bj{q,^,l,r])) 


< al 6 



'Ji -ij ^-Q' 


Note that the right hand side is independent of q, i? and r/, and, as J tends to infinity, this 
bound is going to zero. Therefore, for fixed i? and r], the series 6 (g, i?, 1, 77 ) converges 


uniformly on the compact set {q \ |( 7 | < |}. Thus, for i? with | 7 ?| < 1/U and rj with |? 7 | < |, 
this function is analytic. Note that this result stays true if -d = 1 and 17 = 1. 

We use Cauchy’s integral formula again and obtain 

-§^{b{q,i}, l,r/) - bj{q,'&, l,r])) 


di 


dw'^ dq^ 


{b{q,i9,l,w) - bj{q,i},l,w)) 


fV 

2,7ri J\n—w\=^ 


dq° 


{rj - r(;)'T'+i 


‘ dr/ 


< 7 ! 6 "^ 


ga 

dq^ 


(6(g,r?, 1,7/;) - 6j(g,7?, l,7u)) 


ri 


=\ -i) ^-Q' 

Note that |7/;| < | implies \r]\ < |. Moreover, g^cyg^-y b{q, 7 ?, 1,7/;) is analytic in 7 ? with |7?| < 1/17. 
Again, this result stays true if 7 ? = 1 and 17 = 1. 

Using Cauchy’s integral formula once more yields 


ga g^ g-y 
dq°‘ dul^ dw'^ 


{b{q,u, l,7c) - bj{q,u, l,7c)) 


/ 3 ! 


27ri 7|^_.„|=i2s^ 


</3! (iVlogv/2)^ 


1,'«^) - bj{q,T},l,w)) 
( 7 ? - 7 /)/^+i 

{b{q,u,l,w) - bj{q,u,l,w)) 




ga g^ 


□ 


5^“ dw'^ 

which is the desired result after inserting the bound from above 

In Section]^ we analyze the distinct depths of leaves. Again, we work with infinite sums by 
replacing them with finite sums and bounding the error we make. Similar to the estimates 
above, we define 

6 j(g, 1,77,1)= 5 : 

^ ^ , 1 - oLl J-J- 1 - oW 

i<j<J ^ *=i ^ 

and have the following two lemmata. 

Lemma 2.7. Let J £ N, g £ C with \q\ < 1 and v € C. Set 


Q = \q\' 1 + 


1-kl 


[■/I 
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and suppose Q < 1 holds. Then 


\KqA,vA) -bj{q,l,vA)\ < 


kl 


[Ji /j-i 


Proof. Set 


|l — I y 
R = b{q, l,v, 1) - hj{q, 1, v, 1) = ^ 


n 1 + 


i=l 


1 - qlil III 1-Q' 


vqVl gl* 


n 


, 1 - I*--*: 1-gW 

j>j ^ 1=1 ^ 

Let j > J. We have 

IjI = [I«^l + ~ <^1 > [[«^1 + {j — J)- 

Therefore, for j > J we obtain 


1 - r; - gW 


Jj} TT 

^ l-qM 

1 = 1 ^ 

This leads to the bound 




1-kl 




j-JJ-l / 

I nb+ 


2=1 


1 - gW I 


1^1 < 


which we wanted to show. 


bl kl 


[JI /j-i 


1 — I 


n 1 + 


^ 2=1 


1 - gW I 


E 


j>j 


□ 


The result of the previous lemma can be extended to derivatives, see below. The proof is 


skipped as it is very similar to the proof of Lemma 2.6 


Lemma 2.8. Let J G N, a G No and 7 G Nq. Further, let q ^ C with |g| < | and v £ C with 
|u| < |. Set 


Q = 


1 + 


i-(i) 

and suppose J was ehosen sueh that Q <1 holds. Then 


5\['^1 


ga+'y 




(6(g, l,u, 1) -bj{q,l,v,l)) 


<a!7!6"+^-- 

( )W _ 1 


J-l 

n|i+ 


i-(i) 


5^ W 


1-Q 


3. The Height 


We start our analysis with the height h(T] of a canonical tree T G T. It turns out that 
the height is asymptotically (for large sizes n = n{T)) normally distributed, and we will even 
prove a local limit theorem for it. Moreover, we obtain asymptotic expressions for its mean 
and variance. This will be achieved by means of the generating function H{q, u, v, w) derived 
in Section |2j 

So let us have a look at the bivariate generating function 

^n(T)wh{T) ^ a(g, 1,1, w) 


H{q, 1, l,rc) = q^'- 

Ter 

for the height. We consider its denominator 


1 - 6(g, 1, l,u;) 


D{q,w) := I - b{q,l,l,w) = Yl ^ 


j>0 


J-J- 1 _ ' 

2 = 1 ^ 
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From Lemma 2.3 we know that D[q, 1) has a simple dominant zero qq. We can see the 
expansion of D{q,w) around {qo, 1) as perturbation of a meromorphic singularity, cf. the book 
of Flajolet and Sedgewick m Section IX.6]. This yields a central limit theorem (normal 
distribution) for the height without much effort. But we can do better: we can show a local 
limit theorem for the height. The precise results are stated in the following theorem. 

Theorem III. For a randomly chosen tree T of size n the height h{T) is asymptotically 
(for n —)■ ooj normally distributed, and a local limit theorem holds. Its mean is pL^n + 0(1) 
and its varianee is afji + 0(1) with 


go^&(g,l,l,l)lg=go 

_ 1 ^ t-2 ^ 2t‘^ + 3t-8 ^ 9t^ + 45t2 + 2t - 
~ 2 2^+3 2^t+5 ' pi+8 


(3.1) 


0.55t^ , , 


and 


for t >2. 


_2 1 -O + 5t - 2 -40 

^h-^ + • ^ ^ + — 


40 + 27t - 14 0.26O , , 


221+6 


231 


Recall that “randomly chosen” here and everywhere else in this article means “uniformly 
chosen at random” and that the error functions £j{...) are functions with absolute value 
bounded by 1, see also the last paragraph of the introduction. 

We calculated the values of the constants fih and numerically for 2 < t < 30. Those 
values can be found in Table Figure |TT] shows the result of Theorem III It compares 


the obtained normality with the distribution of the height calculated for particular values in 
SageMath. 


t 

Ph 


2 

0.5517980333242771 

0.3191028720021838 

3 

0.5330219170893142 

0.2640876574238174 

4 

0.5216130806307567 

0.2465933142213578 

5 

0.5137644952434437 

0.2404182939877220 

6 

0.5084950082062925 

0.2396633993742431 

7 

0.5051047365215813 

0.2411570855092153 

8 

0.5030001253275540 

0.2432575483836212 

9 

0.5017308605343554 

0.2452173961787762 

10 

0.5009832278618640 

0.2467757623911673 


Table 2. Numerical values of the constants in mean and variance of the height 


for small values of t, cf. Theorem III See also Remark 3.1 For the accuracy of 


these numerical results see the note at the end of the introduction. 


Remark 3.1. For the (central and local) limit theorem to hold, it is essential that cr^ ^ 0, 
which is why we need reliable numerical values and estimates for large t. As mentioned earlier, 
we used interval arithmetic in SageMath [21] in all our numerical calculations to achieve such 
results. We used a precision of 53 bits (machine precision) for the bounds of the intervals. All 
values are calculated to such a precision that the error is at most the magnitude of the last 
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height 


•10 


-2 



true values 
■ Theorem ITIT 


height 


Figure 3.1. Distribution of the height for t = 2, and n = 30 (top figure) and 
n = 200 (bottom figure) inner vertices. On the one hand, this figure shows the 
true distribution of all trees of the given size and on the other hand the result 
on the asymptotic normal distribution (Theorem III with only main terms of 
mean and variance taken into account). 


digit that occurs. The reason for the varying number of digits after the decimal point (in, for 
example. Table are numerical artifacts. In these cases, we could have given an additional 
digit at the cost of a slightly greater error (twice the magnitude of the last digit). 

The proof of Theorem m is split up into several parts. At first, we get asymptotic normality 
(central limit theorem) and the constants for mean and variance by using Theorem IX.9 
(meromorphic singularity perturbation) from the book of Flajolet and Sedgewick [15] . For the 
local limit theorem we need to analyze the absolute value of the dominant zero qo{w) of the 
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denominator D{q,w) of the generating function H{q, 1, l,u>). Going along the unit circle, i.e., 
taking w = e*‘^, this value has to have a unique minimum at <y9 = 0. 

From the combinatorial background of the problem (non-negativity of coefficients) it is 
clear that |go(e*'^)| > |(?o(l)|- The task showing the uniqueness of this minimum at y? = 0 is 
again split up: We show that the function | is convex in a region around y? = 0 (central 

region), see Lemmata 3.3 to 3.5 For the outer region, where (p is not near 0, we show that 


zeros of the denominator are larger there. This is done in Lemma 3.2 


Those lemmata mentioned above showing that the minimum is unique work for all general 
t > 30. For the remaining t, precisely, for each t with 2 <t < 30, the same ideas are used, but 
the checking is done algorithmically using interval arithmetic and the mathematics software 
system SageMath j24] . Details are given in Remark 3.7 


So much for the idea of the proof. We start the actual proof by analyzing the denominator 
D{q,w). For our calculations we will truncate this infinite sum and use the finite sum 


Dj{q,w):= (-l)^u;^ 




0<j<J 


1 — gW 


instead. Bounds for the tails (difference between the infinite and the finite sum) are given by 
Lemma |2.5| In particular, we write down the special case J = 2 of this lemma, which will be 
needed a couple of times in this section. Substituting 1/z for q, we get 

\D{l/z,w) -D2{1 /z,w)\ < \w\^ I, \ I 7 -^- 7 -;:. . -W’ (3-2) 


\z - 1 


-ll-\w\/{\z\ 


- 1 )' 


under the assumption |tc| < \z 




— 1. Derivatives of D{q,w) are handled by Lemma 


2.6 


As mentioned earlier, the proof of the local limit theorem for the height for general t consists 
of two parts: one for w in the central region (around u) = 1) and one for w in the outer region. 
The following lemma shows that everything is fine in the outer region. After that, a couple of 
lemmata are needed to prove our result for the central region. 

Lemma 3.2. Let w = where (p is real with ^97/96 tt2“*/^ < \p\ < vr. Then each zero of 
z !->■ D{l/z,w) has absolute value smaller than 2 — 1/2*. 

Proof. Suppose that we have a zero zq of the denominator D{l/z,w) for a given w and that 
this zero fulfils \zo\ > 2 — 1/2*. We can extend the equation D{l/zo,w) = 0 to 


w 


0 = 1-- D{l/zo,w) - D2{1 /zo,w) , 

zq — 1 


which can be rewritten as 


zo = l + w-{zQ-l) {D{1/zq,w) - D2{1/zq,w)) . 


Taking absolute values and using bound (3.2| obtained from Lemma 2.6 yields 

1 


\zo\ < |1 -I- tel -I- 


1 


i2l 


- 1 


i-i/(i^or"“-i) 


We have the lower bounds 


i2l 


- 1 


> \zo 




- 1 > 2 ' 
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and 


.[31 _ . 

Zq 1 


> - 1 > (^1 - 


* +*+i 807159 , 

- 1 > _ > 49, 


16384 

which can be found by using monotonicity and the value at t = 2. Therefore, we obtain 

49 1 

\zo\<\l + w\ + --^. (3.3) 

Since we have assumed |zo| > 2 — 1/2*, we deduce 

, , 97 1 

|1 + «^|> 2 --^. 

On the other hand, using \ip\ > y*97/96 7r2“*/^ and the inequality |sin((^/4)| > \ip\ /(\/27r) for 
|(/3| < vr (which follows by concavity of the sine on the interval [0, |]), we have 


|1 + u;| = \/2 + 2 cos ip = 2 ^1 — 2 sin^ 
which yields a contradiction. 


2 9 97 1 

<2 - <2 - 7 , 

7r2 ^ 48 2* ’ 


□ 


Next, we come to the central region. Looking at the assumptions used in Lemma 3.2 this 
is when |y?| < \/97/96 vr 2“*/^. As mentioned in the sketch of the proof, we show that the 
function \qo{e^'^) \ is convex. 

We know the location of the dominant and second dominant zero of the denominator D{q, 1). 
As we need those roots for general w (along the unit circle), we analyze the difference of 
D{q,w) from D{q, 1). Using Rouche’s theorem then yields a bound for the dominant zero, 
which is stated precisely in the following lemma. 

Lemma 3.3. Suppose t >5 and Ire — 1| < g Then q i—)■ D{q,w) has exactly one root 

with |(?| < I and no root with l^l = |. 

Proof. We use Rouche’s theorem on the circle |( 7 | = |. With |u;| < |, Ig'l = | and the 
bound 


(obtained from Lemma 2.6) we get 


9 


\D{q,w) -D 2 {q,w)\ < - 


1 


1 


< 3.29 (1) =6, 


2 (3/2)1+* _ 11 _ (3/2)/((3/2)1+*+*" - 1) 

where we took out the factor (2/3)* and used monotonicity together with the value for t = 5. 
With D 2 {q,w) = 1 — wq/{l — q) we obtain 

\D{q,w) - D{q, 1)| < \D{q,w) - D 2 {q,w)\ + \D 2 {q,w) - £> 2 ( 9 ,1)| + \D 2 {q, 1) - D{q, 1)| 


< 26 + Ire — 1| 


q 


l-q 


<2b + 2\w-l\<l + 2b-l0{-] <1-6. 


On the other hand, the Mobius transform q 1 — q/(1 — q) maps the circle \q\ = 2/3 to the 
circle \z — 1/5| = 6/5. Therefore |1 — q/{l — <?)| > 1, and so we have 




1 - 


l-q 


- \D{q,l) -D 2 {q,l)\ >1-6. 


This proves the lemma by Rouche’s theorem and Lemma |2.3| 


□ 

The previous lemma gives us exactly one value qo{w) for each re in a region around 1. We 
continue by showing that this function qQ is analytic. 
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Lemma 3.4. For t > 5 and Iw — 1| < ^ — 5(|)*, the function qo{w) given implicitly by 
D{qQ{w) ,w) =0, |(?o(wi)| < is analytic. 

Proof. We follow the lines of the proof of the Analytic Inversion Lemma, cf. Flajolet and 
Sedgewick m. Chapter IV.7. Consider the function 

CTiiw) = - ® ■ V, -r— qdq. 

27rz/|g|=| D{q,w) ^ 

Since D(q, w) ^ 0 for all q and w allowed by the assumptions, this function is continuous. 
Moreover, using the theorems of Morera and Fubini, and Cauchy’s integral theorem, the 
function fii is analytic. By Lemma |3.3| and by using the residue theorem we get that cri{w) 
equals q fulfilling D{q,w) = 0 and \q\ < |, i.e., we obtain ai{w) = qo{w). □ 


Since we have analyticity of qo in a region around 1 by Lemma |3.4[ we can show that small 
changes in w do not matter much, see the following lemma for details. Later, this is used to 
estimate the derivative at some point w by the derivative at 1 . 


Lemma 3.5. Let t >30 and w = e^'^, where (p £ 
inequalities 


with \p\ < y^97/96 7r2 IFe have the 


■i 17 102 

\qo{w) - qQ{l)\< ko(w^) - 9o(l)| < ^ and |go(«^) “ 9c)(l)| < 


Proof. Set d = 5 — 5 (i) . By Lemma 


3.4 


Therefore, by Cauchy’s integral formula, we 

k\ 


- qi\l) = ^. i ( 

2m J\c-i\=d V 


the function qo{w) is analytic for lie — 1 | < d. 
■get 

qoiO qoiO 


|C-i|=d 




dC 


(k) 

for /c £ No, where gg ^ denotes the k-th derivative of q^. For its absolute value we obtain 


qi^\w) - qll'\l) < kid max |go(C)| max - w) - (C - 1) 

|C-i|=rf IC-iNc* 

We have 1(70(01 < § by Lemma 3.3 Further, we get 


< liu — 1| (A: + 1) max |C — 


Since 


1^ - 1| < h - 1| = - 1| < 




< \P\ 


we have |C ~ > d — \(p\. Collecting all those results, and using d < ^ and the bound given 

for \p\ results in 

\ -(fc+2) 


q^Q\w)-qf\l) 


< 


{k + l)\ 


1 


u;-l| I X -5 ( - 1 - i/^7r2 


/97 


y-t/2 


Inserting all bounds gives the estimates stated for k £ {0,1,2}. 


□ 
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3.5 


to 


Now we are ready to show that the second derivative of is positive. To do so, we 

show that this second derivative is around | for (/? = 0 and use the bounds of Lemma 
conclude positivity for w in some region around 1. 

Lemma 3.6. If t > 30 and y? G M with \ip\ < a/97/96 vr then 

d^ / .-..N 2 


Proof. Write 


d 


d (/?2 


q=qo{w) 


qo 




> 0 . 


and 




q=qoiw) 


and analogously Agg, Ag^ and for the function D{q,w) derived twice and then evaluated 
nserti 

q'oiw) = - ‘ 


at q = qo{w). By inserting the asymptotic expansion of qo, see Lemma 2.3 into the expressions 

A„ 


A„ 


and qo(^) = 


9A A A — A — A A^ 

^qq^w ^ww^q 




(3.4) 


obtained by implicit differentiation, we find 

//.A 1 0.07t 

9o(l) = “I + and 

For the calculations themselves, we used the approximation Do{q,w) of the denominator 


1 0.04t2 

9o(l) = 1 + 


D{q,w) together with the bound for the tail give n in Lemma 2.6 
Set w = e*‘^. Using the bounds of Lemma 3.5 yields 


6 


90 ( 

9o( 


n I 

9ol 


- 2+ 

18 


'b’d = 5 

,( j \ 1 103 

i + 

We define x{ip) and ?/((/?) to be the real and imaginary parts of qo{e^‘^), respectively. Thus 

x{p)Piy{p) = 90 , 


and 


x'{p) + i y'{p) = ie^'^ 9o 

x"{p) +iy"(¥?) = -e*‘^9o(e*‘^) 


-e^*‘"9o 


Fd ■ 


Then, the estimates above lead to 

, , 1 6 . . 

= 2+ 

19 

’ 

124 


//A 1 19 

2/(</^) = -4 + ^ei2(t), 

124 

/(7^) = ^U4(t). 
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These in turn together with 


give us the second derivative 

d2 


'qo(/'^) + y'{(pf+ x{(p)x"{(p) + y{ip)y"{ip)) 


(3.5) 


dip^ 


qo 




ip=0 


1 144 

= 8+ > 0.1206, 


which is what we wanted to show. 


□ 


Remark 3.7. The ideas in this section presented so far can also be used to show the uniqueness 
of the minimum of \qo{e'^‘^) | at (/? = 0 for a fixed t. In particular, this works for t < 30, where 
some of the results above do not apply. 

For the calculations the mathematics software system SageMath |24] is used. Further, we 
use interval arithmetic for all operations. The checking for fixed t is done in the following way. 
We start with the interval [—4,4] for ip. In each step, we check if the second derivative (using 


Equations (3.4) and (3.51) is positive. If not, then we half each of the bounds of the interval 


and repeat the step above. When this stops, we end up with a region around 0 that is convex. 
For its complementary, we now use a bisection method to show that |go(e*‘^)| > |(?o(l)|- Note 
that we can use an approximation Dj{q, w) instead of the denominator D{q, w), which can be 


compensated taking the bounds obtained in Lemma 2.6 into account. 


For 2 < t < 30, those calculations were done with a positive result, i.e., the minimum at 
(/? = 0 is unique. 

Now we have all results together to prove the main theorem of this section. 

Proof of Theorem m We use Theorem IX.9 of Flajolet and Sedgewick m and apply that 
theorem to the function H{q, 1,1, w). This gives us the mean and the variance and as a central 
limit asymptotic normality. In particular, we obtain 

^ ^ [q-]H{q,1,1,1) • 

_ a{q,'^A,'^) ^KqAA,w)\i,=i ^a{q,l,l,w)\^=i 
(l-6(g,l,l,l))2 l-b{q,1,1,1) 

By singularity analysis, we can extract the asymptotics to get the linear behavior of this mean 


By (2.3), we have 




and in particular the constant (3.1) 


For the local limit, we need a more refined analysis. Recall the notation D(q, w) as the 
denominator of H{q, 1, l,w) and let qo{w) be given implicitly by D{q(){w) ,w) = 0, |(j'o(w^)| < Ij 


according to Lemma 2.3 and to Lemma 3.3 Set qo = go(l) a-nd 


Qa+'y 

dq°‘dw'^ 


D{q,w] 


q = go,w = i 

Then we obtain the asymptotic formula + 0(1) for the mean, with 

coi 


Rh — 


cioqo 
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and the variance is ajji + 0(1) with 


2 _ CoiC2ogo + Coicfo^O “ 2 CoiCioCiigo + C02 CioQ'0 + CqiCio 

^ 4oii 


To calculate the coefficients c, 
with infinite sums, we use the approximations Dj{q,w) 


a-f we need derivatives of D(q, w). In order to avoid working 


Lemma 2.6 shows that the error 


made by using those approximations is small. For the calculations themselves, SageMath 
was used. 

To show the local limit theorem, we have to show 


go(e*^)|>ko(l)| 


for all non-zero ip E [— tt, tt ], cf. Chapter IX.9 of m- 


Let t > 30. Lemma 


3.6 


states that 1^(6*“^) | is convex for \ip\ < y^97/96 7r2“^/^, therefore 
the minimum at (/? = 0 is unique for these (p. 

For all other ip, the value of \qo{e''^)\ is greater than 1/(2 — 1/2*) > 1/2 -|- 1/2*+^ by 
This value itself is greater than i + 0.1251/2* > |(7o(l)|. Therefore the minimum 


3.2 


Lemma 

at (/? = 0 is unique and the local limit thorem follows for t >30. 

When t < 30, we use an algorithmic approach to check that the minimum at 99 = 0 is 
unique. The details can be found in Remark 3.7 □ 


4. The Number of Distinct Depths of Leaves 


In this section we study the number of distinct depths of leaves d{T) of a canonical tree 
T motivated by the interpretation as the number of distinct code word lengths in Huffman 
codes. This parameter is also asymptotically normally distributed, and we show a local limit 
theorem. The approach is essentially the same as for the height. It is based on the generating 
function H{q,u,v,w) from Section]^ To analyse the parameter d{T), we look at the bivariate 
generating function 


Ter 


a{Q, 1) 

1 -b{q,l,v,l) 


for the number of distinct depths of leaves. Again, we consider its denominator 


I — y — 

D(q,v) := 1 - 6(,, 1,1) = 1 - 1 : n — r ^pr 

l<j ^ i=l ^ 


and proceed as in the previous section. Lemma 2.3 tells us the existence of a simple dominant 
zero go of D{q, 1). Again, we expand the denominator D{q,v) around {qo, 1) and use Theorem 
IX.9 from the book of Flajolet and Sedgewick m to get asymptotic normality. The local 
limit theorem follows from considerations of the dominant zero of D{q,v) with v on the unit 
circle. This results in the following theorem. 


Theorem IV. For a randomly chosen tree T € F of size n the number of distinct depths of 
leaves d{T) is asymptotically (for n —)■ 00 / normally distributed, and a local limit theorem 
holds. Its mean is pdn + 0(1) its varianee is a'^n + 0(1) with 

_ 1 ^ t-4 ^ -t-U ^ 9t^ + 27t‘^ - 76t - 144 ^ 0.06f^ _ 

~ 2 2*+3 2^1+5 ' pI+8 ' 24* ^16 (d 
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and 

2 1 -t‘^ + 9t- 14 -4t3 + 20^2 + 3i - 54 O-OSei-^ 

- 4 + ^ + ^2iT6 ^ 

for t >2. 


Again, as in the previous section, we calculated the values of the constants fid and ci^ 
numerically for 2 <t < 30, and they are given in Table Figure 4.1 visualizes the result of 
Theorem |IV| as in the previous section. 


t 

ti-d 


2 

0.4151957394337730 

0.2449371766120133 

3 

0.4869093777539261 

0.2893609775712220 

4 

0.5024588321518999 

0.2741197923680785 

5 

0.5050331956677906 

0.2607084483093273 

6 

0.5043408269340902 

0.2530808413006747 

7 

0.5030838633817897 

0.2495578056054622 

8 

0.5020050053196332 

0.2483362931739359 

9 

0.5012375070905982 

0.2482103208441571 

10 

0.5007377066674932 

0.2485046286268308 


Table 3 . Values of the constants in mean and variance of the number of distinct 


depths of leaves for small values of t, cf. Theorem IV See also Remark 3.1 For 


the accuracy of these numerical results see the note at the end of the introduc¬ 
tion. 


As mentioned above, the proof of Theorem IV works analogously to the proof of Theorem |III| 
It is again spread over several lemmata. There is a one-to-one correspondence of Lemmata |4.1| 


to 4.5 to Lemmata 3.2 to 3.6 in the section for the height parameter. Due to their similarities, 
the proofs are skipped a couple of times and only some differences (for example, the different 
constants) are mentioned. The idea of the proof itself is described in the previous section 
below Theorem imi 

To show Theorem IIVI it is convenient to work with the finite sum 


vq 


Dj{q,v)-.= l- 

^<j<J ^ * = 1 




1 - gW 


instead of the denominator D{q, u) = 1 — b{q, 1, u, 1). The error made by this approximation 
was analyzed at the end of Section namely in the two lemmata |2.7| and |2.8[ 

For the local limit theorem, we split up into the central region around v = 1 and an outer 
region. The following lemma covers the latter one. 

Lemma 4.1. Let v = where ip is real with 2 tt 2~^^‘^ < \ip\ < vr. Then each zero of 
z !-)• D{l/z,v) has absolute value smaller than 2 — 1/2*. 


The proof goes along the same lines as the proof of Lemma 3.2 but we get the bound 

7 


\zo\ < |1 -I- -f 


2 * 


instead of (3.3). 


Next, we go on to the central region. As a first step, we bound the location of the dominant 


zero. 
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true values 
■ Theorem ITV 


•10 


-2 



true values 
■ Theorem ITV 


Figure 4.1. Distribution of the distinct depths of leaves for t = 2, and n = 30 
(top figure) and n = 200 (bottom figure) inner vertices. On the one hand, this 
figure shows the true distribution of all trees of the given size and on the other 


hand the result on the asymptotic normal distribution (Theorem IV with only 
main terms of mean and variance taken into account). 


Lemma 4.2. Suppose t> A and |u — 1| < | — 5(|) , then q e-)■ D{q, v) has exactly one root 
with |(?| < i and no root with |(;| = i. 


This lemma is proven analogously to Lemma 3.3 The only difference is the bound 

t 


|L>(g,u) - D2(g,u)| < 3.09 =b, 


which is valid for t > 4. 
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Lemma 4.3. For t > 4 and — 1| < ^ — 5(|)*, the function qo{v) given implicitly by 
D{qo{v) ,t>) = 0, |(7o('i^)| < Is analytic. 

The proof of this analyticity result is the same as the one from Lemma 3.4 therefore skipped 
here. 

In the central region around u = 1, small changes in v do not change the location of the 
dominant zero much, which is made explicit in the lemma below. 

Lemma 4.4. Let t > 30 and v = where £ M with \tp\ < 27r2 then 

Q .34 202 

ko(^') - 9o(l)| < ^, \(^q{v) - q'Q{l)\< ^ and |g()'(u) - go(l)| < 

Again, the proof works analogously to the proof of the corresponding lemma for the height 
parameter. 

In order to prove the local limit theorem we show that the second derivative of |^o(e*‘^)| is 
positive. This is stated in the following lemma. 

Lemma 4.5. If t> 30 and y? G M with \ip\ < 27r2“*/^, then 

d2 


d (/?2 


90 




> 0 . 


We use the proof of Lemma [34)] and update the constants. 

For a fixed t we can use the mathematics software system SageMath 


and perform 


calculations with interval arithmetic. The details, which are stated for the height in Remark 3.7 
remain valid. For integers t fulfilling 2 < t < 30 we showed that \qo{e^'^) \ has a unique minimum 
at (^ = 0. 

The proof of Theorem |IV| follows by the same arguments as the proof of Theorem |IIIt We 
use Theorem IX.9 of Flajolet and Sedgewick m applied to the function H{q, l,v, 1) to get 
mean and variance (and asymptotic normality as a central limit, too). For the local limit 


theorem the uniqueness of the minimum of 


is shown by a two-fold strategy. The 
(using previous lemmata as 


4.5 


central region with |(^| < \/3 7r2“*/^ is covered by Lemma 
prerequisites). Lemma 4.1 discusses the outer region. For t < 30 the algorithmic approach 
above is used. 


5. The Width 

In this section, we consider the width, i.e., the maximum number of leaves on the same 
level, for which we have the following theorem. 

Theorem V. For a randomly chosen tree T € F of size n, we have 

E{w{T)) = logn-|-0(loglogn) 
for the expectation of the width w{T), where pL^ is given by 

1/1 1 0.2t \ 

“ DA + 4,2.Lg5 2 + 

for t > 10. For 2 <t <9, the values of fiw o,re given in Table^ 

Furthermore, we have the concentration property 

¥{\w{T) - iJ,^logn\ > apt^iloglogn) = o(- — (5.1) 

Vlog nj 


for a > 2. 
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t 

f-^W 

2 

1.710776751014961 

3 

0.7660531443158307 

4 

0.4936068552417457 

5 

0.3650919029615249 

6 

0.2902388863790219 

7 

0.2411430286905858 

8 

0.2063933963643483 

9 

0.1804647899046739 

10 

0.1603561167643597 


Table 4. Numerical values of the constants //lu for 2 < t < 10, cf. Theorem [V| 


See also Remark 3.1 For the accuracy of these numerical results see the note 


at the end of the introduction. 



0.4 


0.3 

>1 


ce 

O 

0.2 

ft 

0.1 


• true values 
— expectation 


5 10 15 

leaf-width 


Figure 5.1. Distribution of the leaf-width for t = 2 and n = 100 inner vertices. 
On the one hand, this figure shows the true distribution of all trees of the given 
size and on the other hand the result on the expectation of this distribution 
(Theorem [V| with only main term of mean taken into account). 


In Figure 5.1 one can find the distribution of the leaf-width for a given parameter set 
together with the mean found in Theorem [V| 

First, we sketch the idea of the proof. We consider trees whose width is bounded by K. The 
corresponding generating function WK{q) can be constructed by a suitable transfer matrix, 
and we quantify the obvious convergence of Wxiq) to H{q, 1,1,1). The dominant singularity 
qx of Wxiq) is estimated by truncating the infinite positive eigenvector of an infinite transfer 
matrix corresponding to H{q,1,1,1) and applying methods from Perron-Frobenius theory. 
Then the probability P(?/;(r) < K) can be extracted from Wxiq) using singularity analysis. 
Our key estimate states that the singularity qx converges exponentially to qQ, from which the 
main term of the expectation as well as the concentration property are obtained quite easily. 
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A more precise result on the distribution of the width would depend on a better understanding 
of the behaviour of qx as K ^ oo, which seems to be quite complicated. 

The proof of the theorem depends on the following definitions. Apart from the width w{T), 
we also need the “inner width” w*(T) defined to be 

w*(T) := max L^ik) 

0<k<hlT) 

for a recursive construction. Here, Lxik) denotes the number of leaves at level k. By definition, 
the inner width w*{T) does not take the leaves on the last level into account. 

For > 0, we are interested in the generating function 

WK{q)-= E 

Ter 

w{T)<K 


We represent WK{q) in terms of the generating functions 


WkA<1) ■■= E 

Ter 

w*{T)<K 

m(T)=tr 


for r > 0 so that 


lK/t\ 

WK{q) = l+ E WkAq)- 


Here, the summand 1 corresponds to the tree of order 1. For all other trees, the number m{T) 
of leaves on the last level is clearly a multiple of t. 

Next we set up a recursion for Wk,^ 1 < < A'(A'), where N{K) := \K/{t — 1)] — 1. Let 

us dehne the column vector 


^K{q) ■={WK,M,...,WK,NiK)? 


and the “transfer matrix” 

MAq) ■= {q 

where the Iversonian notatioEEl 

[expr] = 


r r + K 

< s < - 

t t 


l<r<N{K) ’ 

1<s<nIk) 


1 if expr is true, 
0 if expr is false 


popularised by Graham, Knuth and Patashnik m has been used. 
We now express Wxiq) in terms of Mxiq)- 


Lemma 5.1. For K >t, we have 

WK{q) = {I-MK{q))-^ 


A 


Vo/ 


(5.2) 


^Keep in mind that we also use square brackets for extracting coefficients: [q^]Q{q) gives the n-th coefficient 
of the power series Q. 
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Proof. As in the proof of Theorem |TTj a tree T' of height h + 1 > 2, inner width at most K 
and m{T') = rt arises from a tree T of height h, inner width at most K and m{T) = st by 
replacing r of the st leaves of T on the last level by internal vertices with t succeeding leaves 
each. We obviously have r < st. In order to ensure that w*{T') < K, we have to ensure that 
st — r < K. We rewrite these two inequalities as 


r 

- <s< 
t 


r + K 
t 


(5.3) 


If r < N{K), we have r < K/{t — 1) and therefore s < K/{t — 1) by (5.3), i.e., s < N{K). 
This justifies our choice of N{K). The construction above yields s new internal vertices in T'. 
There is only one tree T' of height < 2, namely the star of order t + 1, which has one internal 
vertex (the root). In this case, r = 1. 

Translating these considerations into the language of generating functions yields 


N(K) 

WkM) =q[r = l]+ ^ q'' 

S = 1 


r r + K 

- <s< - 

t t 


WkM)- 


Rewriting this in vector form yields (5.2). 


□ 


We will obtain asymptotic expressions for the coefficients of by singularity analysis. 
To this end, we have to find the singularities of {I — MK{q))~^ as a meromorphic function in 
q. In order to do so, we have to consider the zeros of the determinant det(/ — Mx(q)). Note 
that qx is a zero of det(/ — M^iq)) if and only if 1 is an eigenvalue of MxiqK)- In the next 
lemma, we collect a few results connecting Mxiq) with Perron-Frobenius theory. 


Lemma 5.2. Let K > t and q > 0. Then 

(1) the matrix Mp^iq) is a non-negative, irreducible, primitive matrix; 

(2) the function q i-A Amax(Afi^((?)) mapping q to the spectral radius of Mxiq) is a strictly 
increasing function from (0,oo) to (0,oo); 

(3) if MK{q)x < X or MK{q)x > x holds componentwise for some positive vector x, then 
Amax(Mif(g)) < 1 or Amax(Mx(g')) > I, respectively. 

Proof. We prove each statement separately. 

(1) The matrix M^iq) is non-negative by dehnition. We note that j < r — 1 holds for 
all r > 2 and r -|- 1 < holds for all r < N{K). This implies that all sub diagonal, 
diagonal and superdiagonal elements of Mxiq) are positive. Thus Mxiq) is irreducible. 
As all diagonal elements are positive, it is also primitive. 

(2) This is an immediate consequence of |16( Theorem 8.8.1(b)]. 

(3) Assume that MK{q)x < x for some positive x. Let > 0 be a left eigenvector of 
Mxiq) to the eigenvalue p{MK{q)). Then 

p{MK{q))y'^x = y'^MK{q)x < y'^x. 

The result follows upon division by y'^x > 0. The case MK{q)x > x is analogous. □ 


We consider the infinite matrix 


Moo{q) := 



r 

- < s 
t 


l<r 

l<s 
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and the infinite determinant det(I — Moo{q)) which is defined to be the limit of the principal 

minors det([r = s] — (?^ [^ < s])i<r<Ar when N tends to oo, cf. Eaves m- For |g| < 1, this 

i<s<Ar 

infinite determinant converges by Eaves’ sufficient condition. 

We now show that the infinite determinant is indeed the denominator of the generating 
function H{q,1,1,1). 

Lemma 5.3. We have 

det(I - Moc{q)) = 1 - b{q, 1,1, 1) 
where b{q, 1,1,1) is given in Lemma \K^ 

Proof. When expanding the infinite determinant, we take the 1 on the diagonal in almost all 
rows and some other entry in rows ai < 02 <■■■< ak for some k. These other entries have to 
come from —Mao{q). Extracting the sign for these rows yields 

det(/- Moo(g)) = ^(-1)^ ^ det(g“'[ai < taj\)i<i^j<k 

k>0 l<ai<a2<---<afe 

= E(-l)'' E det([ai < taj])i<ij<k. 

k>0 l<ai<a2<"-<o,k 

We trivially have Oj < toj for j > i, so all entries on the diagonal of ([oj < and 

above this diagonal are 1. If 02 < tai, the first and the second row of ([a, < taj])i<ij<k 
are identical, so the determinant vanishes. Therefore, we only have to consider summands 
with 02 > tai. In this case, we clearly have Oj > tai for all i > 2, i.e., the first column of 
([oj < taj])i<ij<k is (Ij 0,..., O)'^. Repeating this argument, we see that only summands with 
Oj+i > taj for 1 < j < k contribute to the determinant. For those summands, the matrix 
([oj < taj])i<ij<k equals {[j > i])i<ij<k and thus has determinant 1. 

Therefore, we obtain the representation 

det(/-Moo(g)) = E(-l)" E 9“^+-+“'=. 

With the change of variables oi =: bk and Oj+i — tOj =: b^-j for 1 < j < k, we obtain 
det(/-Moo(g)) = ^(-1)*’ ^ qbm+-+bkm 

k>0 

= n(E ''f'ja. 1,1). 


If K tends to infinity, Wxiq) tends to H{q,1,1,1), as the restriction on the width becomes 
meaningless. For our purposes, we will need a slightly stronger result: we also need convergence 
of the numerator and the denominator of Wxiq) given by ( |5.2[ ) and Cramer’s rule to the 
numerator a{q, 1, 1,1) and the denominator 1 — b{q, 1, 1,1) of H{q, 1,1, 1), respectively. We 
prove this in two steps. The first one is to prove that the numerator and the denominator of 
Wk{<i) tend to the corresponding infinite determinants. This is stated in the following lemma. 

Lemma 5.4. For |g| < 0.6, we have 

det(/ - MK{q)) = det(/ - M^) + 0{q^/^^^^). 
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The same conclusion holds when the s-th column of both I — and I — M^{q) are replaced 

by the vector {q,0,.. with K — 1 and infinitely many zeroes, respectively. The estimate still 
holds for the first derivatives with respect to q. 

Proof. The infinite determinant det(/ — Moo{q)) consists of summands 

± J] g-W = 

seS 

where tt : N —)■ N is a bijection such that there are only finitely many non-fixed points of tt 
and 5 is a finite subset of N containing all non-fixed points of vr. Note that the complement of 
S corresponds to those columns where 1 has been chosen on the diagonal in the expansion of 
the determinant. Not all (tt, S) will actually occur due to the Iversonian expression in the 
definition of Moo{q). 

For every A: G N, there is a bijection from the set 



tt: N —)• N bijective, S' C N finite such that {s G N | 7 r(s) 7 ^ s} C S 

and ^ 7r(s) = /c| 
s£S 


to the set 

j G N, ^ Xi = k with pairwise distinct Xi > 
i=l 

of compositions of k with distinct parts: the set S can be recovered as the set of summands in 
the composition, the permutation vr can be recovered from the order of the summands. 

As there are at most exp(2\/A log A) compositions of k with distinct parts by a result of 
Richmond and Knopfmacher |23], there are at most that many summands in the infinite 
determinant det(I — Moo{q)). 

The difference between det(/ — Moo{q)) and det(/ — Mxiq)) consists of those summands 
which do not choose the 1 on the diagonal in some row > N{K) or which choose an entry 
in some column s and in some row r with s > (r -|- K)lt. In the latter case, the 1 on the 
diagonal cannot be chosen in row s, so that the exponent of q in this summand is at least 
r + s > K/t. So all summands in the difference are of the form for some k > K/t. By 
the triangle inequality and the above estimates, we obtain 

|det(I^ det^I^ ^ ^ ^ ex p{2Vklogk)q^ = 

k>K/t 


|(xi, ...,Xj)G W 


The argument does not change if the s-th column of both matrices is replaced by the column 
vector (gr, 0,..., 0)^. 

Differentiating the determinant can be done term by term. The error term does not change 
as the bound is weak enough. □ 


The second step in the proof of the convergence of the numerator and the denominator of 
Wxiq) consists of the following simple lemma. 

Lemma 5.5. Let Igl < 0.6. Then the denominator det(I — Mxiq)) of Wxiq) converges 
to 1 — h{q, 1,1,1) with error The numerator det(/ — MK{q))WK{q) of Wxiq) 

converges to a{q, 1,1,1) with the same error. The same is true for the first derivatives with 
respect to q. 




30 


CLEMENS HEUBERGER, DANIEL KRENN, AND STEPHAN WAGNER 


Proof. The first statement is simply the combination of Lemmata |5.4| and |5.3[ 

As a formal power series, Wxiq) converges to H(q,1,1,1) as [q^jWxiq) = [q'^]H{q, 1,1,1) 
holds for n < {K — l)/{t — l), because a canonical tree with n internal vertices has 1 + n{t — 1) 
leaves and therefore width at most 1 + n{t — 1). 

As 1 — b{q, 1, 1,1) has no root with |g| < 1/2 by Lemma [23| Wxiq) converges to H{q, 1,1, 1) 
for |g| < 1/2. As the denominator is already known to converge to the denominator 1 — 
b{q, 1,1,1) of H{q, 1,1,1), we conclude that the numerators (which are already known to 
converge to some infinite determinant) actually have to converge to a{q, 1,1,1). 

Taking derivatives with respect to q does not change the argument by Lemma |5.4[ □ 

In order to obtain information on the roots of det{I — M^iq)) and therefore the singularities 
of Wi^(g), we approximate the Perron-Frobenius eigenvector of Mxiq) by the one of the 
infinite matrix Moo{q). The following lemma gives this eigenvector explicitly—as we will see 
in the next section, it has a natural combinatorial interpretation. 


Lemma 5.6. For r >1, we have 

q^{l- ^ [u^^]b{q,u,l,l)^ =[ul'^]b{q,u,l,l). 


(5.4) 


1=1 


In particular, if we set pr = [u'^^]b{qo, u, 1,1), then {pr)r>i is a right eigenvector of Moo{qo) to 
the eigenvalue 1, i.e., 

-^oo(l?o) ■ {Pr)r>l — {Pr)r>l- C^'^) 


Proof. Multiplying the left hand side of (5.4) with and summing over r > 1 yields 

.t 


qu 


1 — qu^ 


-Y^{quy[ui^]b{q,u,l,l) = 


qu 


r>l 

1>1 

jt<r 


1 — qu^ 


-'^[u^^]b{q,u,l,l) {qu- 


t\r 


1=1 


r=jt+l 


qu 


qu 


1 — qu^ 


qu 


1 — qu^ 


_ ^^(gnV*K*]6(g,u,l,l) 

1 qu 

(1 - qu\ 1,1)) = b{q, u, 1, 1), 


□ 


where the last equality comes from Lemma 2.2 This concludes the proof of (5.4). 

Setting O' = go in (5.4) and noting that 1 = b{qo, 1,1,1) = J2r>iPr yields (5.5). 

We now use the fact that {pr)r>i is an eigenvector of M^ciq) to derive bounds for its entries. 

Proposition 5.7. All constants pr, r > 1, are positive, and we have pr = ll{ql/r) and 
Pr = 0{r‘^ql), where 

q* = qQ 


Proof. As we will see later in the proof of Theorem VI equation (6.2), the pr are limits of 


probabilities and therefore a priori non-negative. In fact, this is a consequence of Lemma 2.4 


Moreover, they sum to 1 as mentioned earlier, and in view of the eigenvalue equation and the 
fact that Moo(g) is an irreducible matrix, we even know that they must be strictly positive. 
By the eigenvalue equation (5.5), we have 

Pr > qoP\r/f] 
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for all r > 1. Iterating this yields, with Pmin = iiiiiis<iPs) 


Pr > qo 


E LJog 
1=0 


[lost C-l 


[r/Cl 


’’J'l > dmin^o 


E Liog 
1=0 


[log( r-J-l 


(1+r/d) 




As (70 > 1/i by Lemma 2.3, we have log^go > — 1 and the lower bound follows. 

To prove the upper bound, we proceed in two steps. In a first step, we note that the 
eigenvalue equation (5.5) together with the fact that J2r>iPr = 1 yields the weaker upper 
bound 

Pr = ql Ps < QoYp^ = 9o- 

s>\r/t\ 

In a second step, we use induction on r and assume that ps < cs‘^q^ for s < r for some constant 
c depending on t. Then the eigenvalue equation ( |5.5[ ) yields 

Pr<qo Y Y +qoY‘lo< cql Y ^ 

s>\r/t\ \r/t\<s<r r<s 


-^„r(\r/tV , 2q^\r/t] , g*(l + g*) \ Jr/q , 1 „2r 

- j’- 

{r + t)‘^ 2q^{r + 1) g^,(l + q^)\ r/t 1 


1 - ® 


9o 


- q*) t(l - (1 - q*)^ ) ^* 1 

As t^(l — g*) > 1 for t>2 (cf. Lemma 2.3), we obtain 


qo 


2r 

qo ■ 


^ 2 r rjt 2 ’’(^+1 (^+)) 2 r 

Pr < cr q^qj = cr =cr q^ 


for sufficiently large r. 


□ 


Lemma 5.8. The generating function Wxiq) has a unique singularity qx with {qxl < 0.6 for 
K > Cl, where qj is a suitable positive constant depending on t. It is a simple pole and a zero 
of det(/ — MK{q))- Furthermore 

qo + <qK<qo + qiA q^''' 

for suitable positive constants qj], qj] depending on t. 

Proof. In the following, C 4 , C 5 , ... denote suitable positive constants depending on t. 

As H{q, 1,1,1) has a unique pole q with |q| < 0.6 by Lemma |2.3| and numerator and 
denominator of Wxiq) tend to the numerator and denominator of H{q, 1,1,1) respectively by 
Lemma 5.5 Wxiq) also has a unique pole with |g| < 0.6 for sufficiently large K. 

We set xk = {pi, ■ ■ ■ ,Pn(kY- If we find a q > 0 such that MK{q)xK > xk^ then Lemma 
implies that XmaxiM^iq)) > 1 and qx < q. 

We therefore consider the r-th row of MK{q)xK for some 1 < r < N{K). We have 

Pr Pr+K \ 


5.2 


{MK{q)xK)r = g’' Y Ps>q'' Y 


Ps = q 


2:<c<Z±^ 
t -- t 


r<S< 


r+K 




r+K 
% / 


= Pr[-] (1 


_ Pr+K\ 

Prqo ) 
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by the eigenvalue equation (5.5). By Proposition 5.7 we have 


PrQo 

Therefore, we have 


Pr+K . f 


- 


Pr+K 

PrQo 


qlQo 


1 




> 


> 


1 _ Pr+K 

PrQo J 




This means that for q = qo + cjK'^qQ^^^ we have MK{q)xK > xk: as desired. 
The proof of the lower bound runs along the same lines. 


□ 


Proof of Theorem\^ We choose K large enough so that Wxiq) has a unique singularity qx 
with l^i^l < 0.6 and such that qx/^-^ < 0.99. By singularity analysis and Lemma 5.5 we have 

nw{T) <K)= ^ 0(0.6^/2*)) " \l + 0(0.99")) 

for K > cg- 

We now estimate 


E(^r))= ^(l-P(u;(r)<iL)). 

^■>0 

We use the abbreviation S := 

First, we consider the summands of (5.6) with < n/log^n. By Lemma 


(5.6) 


5.1 


we have 


qiA 

qo J 


> 1 + qa 


log 5 n 


> 1 + qio]- 


log n 


n 


> qTo]logn. 


We conclude that these summands of (5.6) contribute log^nT O(loglogre). Similar estimates 
imply that 


{w{T) - log 5 n<-a log^ log n) = O 


(5.7) 


for (T > 1. 

Now, we consider the summands of (5.6) with n/log^n < < nlog^n. These are 

O(loglogn) summands with each trivially contributing at most 1, so the total contribution is 
O(loglogn). 

Next, we consider the summands of (5.6) with nlog^n < We now have 

1 


QK . ^ . log n 

— < 1 + qn-A^ < 1 + qn-, 

qo nlogn 


and therefore 


F{w{T) < K) > {1 + 0{n I^°gs0-6|/(2i)^^ g^p f—(^n -g i) log > 1 — —• 

V \qo J J log n 

The total contribution of these summands is therefore 0(1). Similar estimates imply that 


{w{T) - logs logs log n) = 0 


(5.8) 


for (T > 2. 
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Next, we consider the summands of (5.6) with < 5*^ 

,2 


This time, we have 


<10 


< 1 


n 

Cl3^ 


and therefore 


{w{T) < K) = {1 + 0{n exp (n + 1) log 


> 1 — Ci4 —. 

n 


The total contribution of these summands is therefore 0(1). 

Finally, we note that all summands with K > tn vanish: any tree with n internal nodes has 
at most width tn. 

Collecting all terms, we obtain 

log n 


E{w{T)) = logg n + 0(log log n) = 


-{t- 1) log go 


+ O (log log n). 


Combining (5.7) and ( |5.8[ ) immediately yields the concentration property (5.11. 
6. The Number of Leaves on the Last Level 


□ 


Analysing the parameter m(T) counting the number of leaves of maximum depth (labelled 
by the variable u in the generating function H{q,u,v,w)) is the topic of this section. Here, 
T is a canonical forest in for some number of roots r. We note that for fixed |rt| < 1, 
the dominant simple pole go of H{q, 1,1,1) is also the dominant singularity of H{q,u, 1,1) 
and is still a simple pole. Therefore, m{T) tends to a discrete limiting distribution; we refer 
to Section IX.2 of the book of Flajolet and Sedgewick m- Note that the number m{T) is 
divisible by t unless T has height 0. The result presented in this section is a very useful tool 
in proving the central limit theorem for the path length in the following section. 


The ore m VI. Let go, Q and U be as deseribed in Lemma \2^ and g* as defined in P roposi- 
tion 5.1 ■ For m > 1 such that mt e Z, we set pm = [u^^]b{qo,u, 1,1) as in Lemma 5.4 Then, 
for a randomly chosen forest T ^ Fr of size n, we have 

F{m{T) = mt)=pm + 0{Q^U^^)=Pm(l + 0(^Q^m(^^y^^ =0{U^^) ( 6 . 1 ) 

uniformly in r. 

Furthermore, we have E(m(T)) = pm + 0{Q^) and V(m(T)) = cr^ + 0{Q^) uniformly in r. 
Here, 


Fm — 


e-t 

2t+i 


t^ + - 5t + 32t3 -g 61t2 - 56t 








3 • 24*+4 2^^ 


1.3t^ , , 

H—xt:“£i9(^) 


and 


(Jm = 2t - 


t‘^ - 3t^ + 13f"^ - 3t^ - 17f 


2t+i 


2^2t-\-3 


3t^ + 59t® + 215t'^ - 89t^ - 208t'^ 

231-1-6 


2f 


for t > 4. For t G {2,3}, the values of pm and cr^ are given in 


Note that by Lemma |2.2t pm = 0 for non-integer m. 

Again, we visualize the distribution of the leaves on the last level for a given parameter set. 


see Figure 6.1 This is compared with the mean of Theorem VI 
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t 

f^m 

9 

2 

3.3008907135661046 

3.4340283494347781 

3 

5.4223250580971105 

10.9926467981432752 

4 

7.5391743055684431 

23.0048877906448059 

5 

9.6531072700455410 

39.9382006717564049 

6 

11.7525465927985450 

61.9509728363450114 

7 

13.8311837210749625 

88.8290211521323761 

8 

15.8889617566427750 

120.2125697911546141 

9 

17.9291240142580452 

155.7621950801096596 

10 

19.9558689242933884 

195.2366537978909468 


Table 5. Numerical values of the constants in mean and variance of the 
number of leaves on the last level for t G {2,3}, cf. Theorem VI See also 


Remark 3.1, For the accuracy of these numerical results see the note at the 


end of the introduction. 


0.6 


^ 0.4 


43 

cS 

43 

O 

Oh 


0.2 




xxxxxxxxxxx 

{5 10 15 20 25 SlT 

leaves on the last level 


• true values 
X Pm 
— expectation 


Figure 6.1. Distribution of the leaves on the last level for t = 2 and n = 30 
inner vertices. On the one hand, this figure shows the true distribution of all 
trees of the given size and on the other hand the result on the expectation of 


this distribution (Theorem VI with only main term of mean taken into account). 


Proof. As the variables v and w do not play any role, we write H{q,u), a{q,u) and b{q,u) 
instead of H{q, u, 1,1), a{q, u, 1,1) and b{q, u, 1,1), respectively. 

By (2.9|, we have U^~*qo/Q < 1, i.e., a{q,u) and b{q,u) are analytic for Igl < qo/Q and 
< ^/U by Theorem [lT| By (2.1) and Lemma 2.3 the meromorphic function q 1 —)■ H{q,u) for 
fixed u with |tt| < 1/U has a unique singularity in {q \ |g| < qo/Q}, namely qo, independently 
of u. 
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We use Cauchy’s formula, the residue theorem and the fact that a{q, u) does not contribute 
to the residue at q = qo to obtain 


[q^]H{q,u) = 


1 


H{q,u) 


27ri 
= — Res 


\q\ = l/2 q 

/ H{q,u) 

V q' 


,n+l 


dq 


■n+l 


,q = qo] + 


b{qo,u)u{r) 

qE 


+ 


1 


J_ <X 

2TTi J\q\=qg/Q 5 ”+^ 

H{q,u) 


dq 


27riJ\q\=qq/Q q 


n+1 


dq 


where u{r) has been defined in (2.11). 


By Lemma 2.4 the probability generating function Pn{u) of m{T) is given by 

Pn{u) = b{qo,u) + 0(Q”), 


( 6 . 2 ) 


uniformly for |u| < 1/U and uniformly in the number of roots r (it suffices to bound the 
numerator and the denominator of H{q,u) separately in order to get a uniform bound in r). 
We remark that this proves non-negativity of the constants pm, which we required in the proof 


of Proposition 5.7 


Expectation and variance follow upon differentiating 6(g0) u) with respect to u and inserting 
3 asymptotic expressi 
In order to compute 


the asymptotic expression for go- Here, we use the bounds derived in Lemma 2.6 

(m(T) = mt), we consider 


u^^mH{q,u)=Pm 


v{r) 

qo 


+ 


1 


(2TTiy 


H{q,u) 

\u\ = l/U J\q\=qo/Q 


dqdu. 


Bounding H{q, u) uniformly in r and using Lemma 2.4 and Proposition 5.7 yields (6.11, taking 
into account that 

TP 2 

^ = 2 + ^e2i(t) 

for t >30 and that U^/q^r > 1 remains true for all t > 2. □ 


7. The Path Length 


This section is devoted to the analysis of the path length. While the external path length is 
most natural in the setting of Huffman codes, it is more convenient to work with the total 
and the internal path length, respectively. As it was pointed out in the introduction, all three 
are essentially equivalent, since they are (deterministically) related by simple linear equations. 


Theorem VII. For a randomly chosen tree T gT of size n the total path length (as well as 
the internal and the external path length) is asymptotically (for n —)• ooj normally distributed. 
Its mean is asymptotically ptpin^ + 0{n) and its variance is asymptotically + 0{n'^) with 

_ t _ t ^ P - 2t ^ 2t^ + 3P-8t ^ 9P + 45t^ + 2t‘^ - 88t ^ 
htpi ~ 2^^ ~ A 2P^ ' 2pt+o ' pI+9 ' 2^7 ^22{t) 

and 


2 _ -P + 5t^ - 2t^ 
^tpi ~ ' 3 • 2*+4 


-6t^ + OP + 27t^ - UP 


+ 


3 • 22*+6 

-27P - 72P -h 237t^ -h 302t^ - 232p 


0.078P , , 

H ^4^e23(i) 


for t > 30. 


3 • 23i+9 
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We determined numerical values of these constants as in the previous sections. They are 
given in TableFigure [7T] shows the result of Theorem VII for particular values. It compares 
the obtained normality with the distribution of the total path length found by a simulation in 
SageMath. 


t 

~2 

3 

4 

5 

6 

7 

8 
9 

10 


^tpi 

0.5517980333242771 

0.7995328756339714 

1.0432261612615134 

1.2844112381086093 

1.5254850246188775 

1.7678665778255347 

2.0120005013102160 

2.2577888724045994 

2.5049161393093200 


^tpi _ 

0.4254704960029117 

0.7922629722714524 

1.3151643425139087 

2.0034857832310170 

2.8759607924909180 

3.9388990633171834 

5.1894943655172528 

6.6208696968269586 

8.2258587463722461 


Table 6 . Values of the constants in mean and variance of the total path length 
for small values of t, cf. Theorem VII See also Remark 3.1 For the accuracy 


of these numerical results see the note at the end of the introduction. 


We first use a generating functions approach to determine the asymptotic behaviour of the 
mean and variance. Let us define 

Lr{q,u,w) := ^ i{Ty 
Ter 

for the r-th moment of the total path length. Note that 


Lo{q, u, w) = H{q, u, 1, tc) = ao{q, u, w) + b{q, u, w)- 


ao{q, 


1 - b{q, l,w) 

in the notation of Theorem [n| but writing oq instead of a and leaving out the parameter v. 

We are specifically interested in Li and ^ 2 - In analogy to the approach we used to determine 
a formula for H{q,u,v,w) in the proof of Theorem [n} we obtain a functional equation for 
Lr{q,u,'w) by first introducing 

Lr,h{Q,u) = [wyLr{q,u,w)= Y. 

Ter 

h{T)=h 

Define, for the sake of convenience, the linear operators and = q^ 

acting on our generating functions. We get the following result for the generating function of 
the first moment. 


Lemma 7.1. We have 


Li(q, u, w) = ai{q, u, w) + b{q, u, w) 


ai{qA,w) 

1 - b{q,l,wy 




ai{q,u,w) = Yi-'^ywy<^u^wLo){q,q^^K^\w) ^ 

j=o i=i 9 a 


with 













CANONICAL TREES, COMPACT PREFIX-FREE CODES AND SUMS OF UNIT FRACTIONS 


37 


• 10 “^ 




total path length •10'^ 


Figure 7.1. Distribution of the total path length for t = 2, and n = 30 (top 
figure) and n = 200 (bottom figure) inner vertices. On the one hand, this figure 
shows the true distribution of all trees of the given size and on the other hand 
the result on the asymptotic normal distribution (Theorem VII with only main 
terms of mean and variance taken into account). In order to take into account 
that the total path length is always even, we rescale the limit distribution. 


Proof. Replacing j leaves of depth h by internal vertices, thus creating jt new leaves of depth 
/i + 1, increases the total path length by jt{h + 1). Thus we get 

m(T) 

L,,h+i{q,u)= E + E (/i + l)m(T)(7-(^)u-(^) 

Per i=i Ter 

h{T)=h h{T)=h+l 

= -7 (Ti,h(g, 1) - Ti,fc(g, qu^)) + {h + l)u—LQ^h+i{q, u) 

1 — qvr ou 
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and = 0- Then, by multiplying by and summing over all h, we obtain 

Li{q,u,w) = - - j [Liiq, l,w) - Li{q,qu\w)) + ^u^wLo{q,u,w). 

1 — qu^ V / 


Lemma 2.1 yields the desired formula for Li{q,u,w). 

Next, we derive a formula for the generating function of the second moment. 
Lemma 7.2. We have 

a2{q, l,rc) 


□ 


L 2 {q, u, w) = a 2 {q, u, w) + b{q, u, w) 


1 - b{q, l,rc) 


with 


a2{q,u,w) = (2{<^u^^Li){q,q^^^u^\w) - (^>^d>^Lo)(g, ,i/;)) fj 

.7=0 i=l 


1 qMyt^ 


1 — gWti*' 

Proof. As in Lemma |7.1[ we derive a functional equation for L 2 (g, u, w). Starting with a tree 
T of height h and creating jt new leaves of depth h + 1 changes the square of the total path 
length from £(T)^ to {i(T) + jt{h + 1))^. This translates to 

m{T) 

i2A+i(9.«)= E E + E (A + i)Mr) 


TeT j=i 

h(T)=h 


TeT 

h{T)=h+l 


+ 2 ^ (/i + l)(£(r)-m(r)(/i + l))m(r)g”('^)u™('^) 


TeT 

h{T)=h+l 


qu 


(T2,h(g, 1) - T2,/7(g,gtt‘)) 


1 — qu*- 

+ 2{h + l)^uLi,h+i{Q, u) — {h + l)^<i>^Lo,/i+i(g, u). 

Note that we have L 2 fl{q,u) = 0. Encoding the height by leads to the functional equation 
for the generating function 


L2{q,u,w) = 


qvfw 
1 — qu*- 


(L 2 (g, 1, re) - L 2 (g, gn% w)) + 2^>„d>^Li(g, u, w) - ^>„d>^Lo(g, u, w). 


Again, Lemma 2.1 finishes this proof. 


□ 


In order to determine the asymptotic behaviour of mean and variance, one only needs to 
find the expansion around the dominating singularity go and apply singularity analysis. The 
main term of the mean is easy to guess: assuming that the vertices are essentially uniformly 
distributed along the entire height, it is natural to conjecture that i{T) is typically around 
tn{T)h{T)/2 and thus of quadratic order. This is indeed true, and the variance turns out 
to be of cubic order (terms of degree 4 cancel, as one would expect). The following lemma 
substantiates these claims for the mean. 

Proposition 7.3. The mean of the total path length is Htpin^ + 0{n) with 


Ttpi — 2 /^^' 
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Proof. By substituting Lq into the functional equation of Lemma 7.1 
expression for Li{q, 1 ,- 10 ), namely 


we get an explicit 


Li{q, 1,-0]) = 




^ gW 


Li 1 — 


+ 


ao{q, l,w) 

{l-b{q,l,w))^ 




X gPl 


J-J- 1 - qlil 
1=1 ^ 


(<l’toOo)(g, 1, u;) ■ ■ I’I gl*ll 

+ 7 :^-TT—^-rwT 2_^[-l)^w^{<^ub)[q,q^^\w) 


(1 -6(g,l,u;))2^^ 


+ 


1 — gW 

gP 




The dominant term in this sum is the first one, with a triple pole at the dominant singularity go- 
The second and third term, however, are also relevant in the calculation of the variance, where 
one further term in the asymptotic expansion is needed in view of the inevitable cancellation in 
the main term. Singularity analysis immediately yields the asymptotic behaviour of the mean: 
since the pole is of cubic order, the order of the mean is quadratic, i.e., it is asymptotically 
equal to where the constant ^tpi is given by 


3 

% 


- 2(i.,6)(,„, 1 ,1)2 j _ 1.1 ■ (7-1) 


Plugging in the definition of 6 as a sum, it is possible to simplify this further. One has 


k [[/7II \ k 

('f«M,.».i) = E{-T-‘(nTyyr i; 


k=l 


1 — J 1 — 


by logarithmic differentiation and thus 




{‘^ub){q,q^ ,1) = (n i_qlh}+nj}) ^ I _ gW+t'*L1 

k=l ^h=l y ^ h=l ^ 




k=l 


^^l-qblj ^^1-q 


E 


t 


h 




since [[/i]] + = \h + jj by definition. Plugging this into (7.11, we find 


l^tpl — 


(d>^6)(go, 1,1) 

2(^>q6)(go, 1,1)2 




■i+fc „w 

% 


k 

E 


-- 1 J ^ 1 Jh+j} ' 

i=i J- — go ii=i ^ ~ % 
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Substituting i = j + k and interchanging the order of summation, we arrive at 


1) 1) 

{^wb){qo, 1 , 1 ) 

_ (^«;^)(90; 1; 1) f TT gp ^ ^ 

2{<!>qb){qo, 1,1)2 1 ql 




PI s £ fc 


i=l ^ — Qo 2 k=l h=l ^ — Qo 


t 


h 


Ih+e-kj 




PI \ ^ r 


1 /^Pl } --- - -- 1 

=1 t - 9o 2 r=l h=l t - 9o 






Noting now that 




e=i 


l-i 1 _ Qpl ) 
=1 ^ y / 


1 - gW ’ 


which can be seen by another logarithmic differentiation, we can replace the sum in the 
expression for fitpi above by t ■ {^qb){qo, 1,1), which finally yields 


_ t {^wb){qo, 1,1) 


and the second fraction is precisely ^h, cf. Equation (3.1). 


□ 


Our next goal is to obtain the asymptotics of the variance, which will again follow by 
applying the tools from singularity analysis together with the result for the mean shown above. 
Let us use the abbreviation 


1=0 




7 PI 


1 - gW 


where M is a function in the variable j and an operator, to simplify the expressions in the 
following lemma. 
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Lemma 7.4. The variance of the total path length is cr'^pin^ + O(n^), where 


2 _ (^0,1,1) (^0, 1,1) , , , i n2 

'^tpl 


{^qh) (go, 1,1)" 


S(go,j ^ 1,^>«)" 


(c^,^.b)(go,l,l) (c^.6)(go,l,l) ^ ^ , 

{%b){qoA,lt 

^ (S(go,j ^ 1,^^,^^.) + S(go, j ^ 

(4>g5) (go,l,l) V 


+ ^( j ^ ^2 


- 1 _ 

i=l 1 Qo 


PI 




(4>^6)(go,l,l)^ 

3(cI>,6)(go, 1,1)3 

(4>g,6)(go,l,l) 

3(cI>,6)(go, 1,1)3 

(^>«,^)(go,l,l) 


2[[j + Ij — 1, ^2} + 9o, j 2t^ 


1 ^IL' 

2=1 ^ Qo 


PI ’ 


S(go, j 1,^«)' 


+ 

+ 

+ 


Proof. In order to calculate the variance, one needs, besides the result of Proposition |7.3[ the 
asymptotic behaviour of L 2 {q, 1,1) at the dominant singularity. Only the terms of pole order 
4 and 5 (i.e., highest and second-highest) are needed. More details on the computation can be 
found in the appendix. By Lemma |7.2| we obtain 


3(4>'^5)(go’l’l)3 ^ l,^«)(s(9o,j ^ l,^>t.^«,) + S(go,j j,<^u))- 


-^2(g,l,l) = 


6ao(g, 1, l)(4>^„5)(g, 1,1)^ 


-P 

-P 

-P 

-P 

-P 


(l-6(g,l,l))5 
4ao(g, 1, l)(4>^b)(g, 1,1)^ 

(1 - KQ: 1,1))"^ 

8ao(g, 1, l)(4>^b)(g, 1,1) 

(1 -6(g, 1,1))4 
6(4>^ao)(g, 1, l)(4>^6)(g, 1,1) 

(1 - K?, 1,1))^ 

2ao(g, 1, l)(4>g,b)(g, 1,1) 

(1 -6(g, 1,1))4 

2ao(g, 1, l)(4>^„6)(g, 1,1) 

(1 - Kt 1,1))"^ 

2ao(g, 1, l)(4>^6)(g, 1,1)^ 


S(g,i >-)■ 1,4>„)" 


^(9,7 IJ + Ij, ‘^’n) + ^ f X! 1 _ [j] 

, ^ i=l ^ ^ 


,^u 


S(g, j >-)■ 1,4>„) S(g, j H> 1,4>„4>^) 


S(g, j 1 -^ l,4>n)' 


+ 0 


(1 - Kq ^ 1,1))"^ 

1 

(1 - K^,i,i))^ 


S(g, j >-)■ 1,4>„)^ 

S(g,j eA l,4>„)S(g,j i-^i,4>«) 
S(g,j hA 


as q tends to go- 



















42 


CLEMENS HEUBERGER, DANIEL KRENN, AND STEPHAN WAGNER 


Applying singularity analysis to the highest- and second-highest order terms of both Li 
and L 2 yields the variance. The terms of order cancel (as one would expect), and one finds 
that the main term of the variance is asymptotically a 


tpl 


n^. 


□ 


In order to obtain expressions (either the asymptotics in t or the values for particular 


given t) of ^tpi and we insert the dominant singularity qo (see Lemma 2.3) into the 


formulas obtained in Proposition 7.3 and Lemma 7.4 We remind the reader again that it is 


important to establish that a'^^i 7 ^ 0 , so numerical values and estimates for large t are needed 
again. A couple of technical difficulties arise due to the infinite sums. These are discussed in 
the following remark. 


Remark 7.5. We use the SageMath 


order to get the asymptotic expression and values for 


mathematics software system for o ur calculations. In 

(note that we have 


VII 


in Theorem 

already due to Proposition 7.3 and the results of Section |^, we have to evaluate infinite 
sums and insert the dominant singularity go- 
We will explain step by step how this is done. 

(a) We start with the expression for found in Lemma 


7.4 


(b) First, let us consider the infinite sums S(g 0 ) For a suitable Js depending on t, we 

calculate the first Js summands directly and use a bound for the tails. More precisely, we 


use 

00 

E( 


-ly M{j)m{qo,q}^\l) (n 


J*1 

% 




=1 1 - qi;' 


Jt. 

i=l ^ 9o 


4‘' 




00 

E 

j=js 


MjQ^ 




with the interval I = [—1,1], M{j) < Mj for j > Js and Q = 

Let us consider the bound Mj. If M(j) = [j]], we set Mj = /{t — 1) and analogously 

for M{j) = Ij -I- Ij. If M{j) = J2i=i ^W/(l ■'^0 use Mj = /{I — qo). Otherwise 

{M{j) = 1 and M{j) = j), we simply take Mj = M{j). These choices allow us to hnd a 
closed form for MjQ^~-^^. 

Proceeding as described above gives an expression consisting of finitely many summands 
containing functions b, which will be handled in the following step. 

(c) Let us deal with the function b{q, u, w) and its derivatives, which all are infinite sums. As 
above, we calculate the first Jb summands directly for a suitable Jb chosen depending on t. 


Then we add the bound provided by Lemmata 2.5 and 2.6 to take care of the tails. 


(d) 


At this point, we end up with a symbolic expression not containing any (visible or 
hidden) inhnite sums; only the variables t, qo, U and the interval I occur. Thus, we are 
almost ready to insert the asymptotic expressions or values for these parameters. 

Now, we are ready to insert the dominant singularity go- Oll the one hand, this can be 
the asymptotic expansion of go as t 00 (in our case valid for t > 30), cf. Lemma 2.3 


We choose Jj: = Jb = 5. The result will then again be an asymptotic expression for cr^^. 

On the other hand, we can use a particular value for go for given t (which for us means, 
more precisely, an interval containing go). In these cases, we choose Js = Jb = 4 for 
8 < t < 30 and higher values for t < 8 (up to Js = Jb = 14 for t = 2). The resulting 
is then computed using interval arithmetic. 













CANONICAL TREES, COMPACT PREFIX-FREE CODES AND SUMS OF UNIT FRACTIONS 


43 


In order to prove asymptotic normality of the total path length, a different, more probabilistic 
approach is needed. Standard theorems from analytic combinatorics no longer apply since the 
path length grows faster than, for example, the height, so that mean and variance no longer 
have linear order. 

We number the internal vertices of a random canonical t-ary tree of size n from 1 to 
n in a natural top-to-bottom, left-to-right way, starting at the root. Let denote the 
depth of the fe-th internal vertex in a random tree T G T of order n. Moreover, set 
Yk^n = Xk+i,n — Xk^n G {0,1}. In words, Yk^n is 1 if the {k + l)-th internal vertex has greater 
distance from the root than the fe-th, and 0 otherwise. It is clear that the height can be 
expressed as 

n—1 

h(T) = 1 + maxXk^n = 1 + ^n,n = 1 + X! ^k,n, 

k=l 

which would indeed be an alternative approach to the central limit theorem for the height. 
More importantly, though, the internal path length can also be expressed in terms of the 
random variables lfc,n) namely by 


n n k—1 n—1 

^internal(T') — 

k=lj=l j=l 


k=l 


Now 


n—1 

R' ^internaliY) — ^ ^ 

i=i 


n-j 


n 


Y, 




can be seen as a sum of n — 1 bounded random variables Zj^n = An advantage of this 

decomposition over other possible decompositions (e.g., by counting the number of vertices at 
different depths) is that the number of variables is not random. Another important point is 
that the Zj^n are bounded after rescaling, so that they also have bounded moments. 

Unfortunately, the Zj^n are neither identically distributed (which is not a major issue) nor 
independent, which makes standard versions of the Central Limit Theorem for sums of random 
variables inapplicable. However, they are almost independent in that they satisfy a so-called 
“strong mixing condition” (Inequality (7.2) of the following lemma). 


Lemma 7.6. Let J-gi be the a-algebra induced by the random variables Zi^n, -2^2,nj • • • ) Zg-^^n, 
and let Qg^ he the a-algebra induced by the random variables Zg^^n, Zg^+i^n, ■ ■ ■, Zn-i,n- There 
exist constants n and A (depending only on t) such that 

|P(A n H) - F{A)F{B) I < (7.2) 

for all 1 < Si < S2 < n and all events A G Xg^ and B G Gg^- 


The main idea of the proof of the strong mixing condition is simple: events A G describe 
the shape of the random tree T up to the si-th internal vertex while events B G Gg 2 
describe the shape of the random tree T from the S 2 -th internal vertex Vg^ on. The probabilities 


of such events can be calculated by means of Lemma |2.4| and Theorem VI, and the exponential 
error terms that one obtains through this approach yield the estimate (7.2) above. 


Proof of Lemma 7.6, For a canonical tree T, let F\{T) and Fp{T) be the number of internal 


vertices on the same level as u<j^, left and right of Vg^, respectively. Similarly, let G\{T) 
and Gp{T) be the number of internal vertices on the same level as Vg^, left and right of Vg^, 
respectively. For fixed si, /a, fp, S 2 , g\ and gp, there is a bijection between the following: 
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rs = 6 , 5 a = 2, = 3 


Figure 7.2. Decomposition of canonical trees. This decomposition into Ti, 


T 2 and T 3 is used in the proof of Lemma 7.6 


j 

D 

rij 

1 

1 

Si - 1 - /a 

2 

/a + 1 + /p 

S2 - 1 - 5a - (si - 1 - /a) 

3 

1 5a + 1 + 5p 1 

n-{s2-l-gx) 


Table 7. Values of rj and rij for the decomposition of a random tree. 


• the set of canonical trees T with F\{T) = /a, Fp{T) = /p, G\{T) = g\, Gp{T) = Qp 
and such that and Vs 2 are on different levels, and 

• the set of tuples (Ti,T 2 ,T 3 ) where Tj is a canonical forest with Vj roots, rij internal 
vertices and rrijt leaves at the last level, where the values of Vj and Uj are given in 
Table Tj has no isolated root^and rrijt > rj+i holds for j G {1, 2}. 


An illustration can be found in Figure 7.2 

Here, Ti consists of the first levels of T up to and including the level of Vg ^, T 2 consists of 
the levels of T from and including the level of up to and including the level of Vg ^, and T 3 
consists of the levels of T from and including the level of Vg^- Note that the internal vertices 
of T are partitioned into those of Ti, T 2 and r 3 as the last level of a forest does not have any 
internal vertices by definition. 

Note that the definition of a canonical forest does allow an arbitrary number of isolated 
roots; by definition, those are leaves and not internal vertices. In order to use Lemma 2.4 and 


Theorem |VI| for our cases, we use the simple bijection between forests with n internal vertices 
and r roots all of which are non-isolated and forests with n — r internal vertices and rt roots 
realised by omitting all r roots. 

With (5=1 + (log2)/(2t) + 0.06/t^ (Lemma 2.3), g* = (Proposition 

(7 = 1 — (log2)/t^ (Theorem VI), we fix 0 < <5 < 1/4 such that 


5.7) and 



(7.3) 


4 


We define an isolated root to be a root without children. 
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holds for all j > 1. We first compute the probability to have at least mit > S(s 2 — si) vertices 
at the level of ■ To do so, we use the decomposition as described above with the following 
modification: we do not use the full decomposition into {Ti,T 2 ,T 3 ), but join the latter two to 
have a decomposition (Ti,T 23 ) in the obvious way. By Lemma 2.4 and Theorem VI we have 


= 0(^0 

canonical trees Ti with mit leaves, and there are 

p(tr 2 )go = O(go 

canonical forests T 23 . Note that we used v{tr 2 ) = 0(1) (see Lemma [Td] ) and < 1. Therefore, 
using U < 1, we find the desired probability to be 

= E T,o{u""‘) 

mi>S{s2-si)/t fx 
mi>S{s2 — si)/t 

= 0{6{s2 - = 0((S2 - 

Analogously, the probability that there are at least ( 5 (s 2 — si) vertices at the level of Vs 2 
is also 0 ((s 2 — In particular, the probability that Vg^ and Vs 2 are on the same 

level is bounded by 0 [{s2 — From now on, we consider the event W that v, 


Si 


and Vs 2 are on different levels and that there at most 6 {s 2 — si) vertices at each of the levels 


of Usj and Vg ^, respectively. The previous discussion shows that 


F{W) > 1 - 0((s2 - 


(7.4) 


Let now two events A G 
in the cr-algebra generated by Zs 2 ^n, -^ 62+1 


in the fi-algebra generated by Zi^n, ■ ■ ■, Zg^^n and B G Gg2 


..., Zn-i^n be given. The event A consists of a 


collection of possible shapes of the random tree T up to the si-th vertex Vg -^, and likewise 
B consists of a collection of possible shapes of the random tree T from the S 2 -th vertex Vg 2 
onwards. For ease of presentation, we assume that the events A and B consist of only one such 
shape up to Si and from S 2 on, respectively; the general case follows upon summation over all 
shapes in A and B. The shapes A and B uniquely determine Fx{T) =: fx and Gp{T) =: gp, 
respectively. On the other hand, Fp{T) and Gx{T) will be somewhat restricted by the shapes 
in A and B, respectively. 


Using Lemma |2.4| Theorem VI and the bijection into a tree and forests described above 
yields the following estimates for the probabilities we are interested in. There, the error term 
0 {Q'^) in the denominator will always be absorbed by the error term in the numerator because 
’\Ye obtain 


fp 
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using the inequalities Q < 1, r 2 < (5(s2 — si) (since we are in the situation that event W 
occurs) and 122 + ns > S 2 — si. We also get 


p(Bnw) = 


p(l)go "(1 + 0(Q-)) 5 


- 711-712 


^ Pm2 I 1 “1“ O I Q 


711+712 


m 2 


jjt\ 7712 ' 

q* 


9 a r^/t<m.2<S{s2—si)/t 


by (7.3) with j = S 2 — si and the inequalities gx < 6 {s 2 — si) (again because W occurs) and 
+ R 2 > ^-2 > (1 — <5)(s 2 — si). Similarly, we calculate the probability that all three events 
A, B and W occur simultaneously as 


P{Ar^Br^W) = 


1 


•'(1)* ”(i + 0((3")) 1;^ 


—n2+7'2 


X Pm2 I 1 + O ( Q 


,712 —7’2 


m 2 


jjt\ 7712' 

q* 


= Y. 


T. + 0(Q(7/‘-“)<->->)), 


/p>9A r3/t<m2<S{s2-si)/t 

where we additionally used r 2 < S(s 2 — si). We conclude that 


^^(^^ 2 ) m+Tig+ra 
p(l) ^0 


|P(i4n w)P(Bnw) -P(AnBnw)l < E E 

/ pi 9 a r3/t<m<5{s2-si) 

X 0(Q(3/4-25)(.2-^i))) 

= n b n w). 

Combining this with (7.4| yields the strong mixing property (I^. 


□ 


Now we are able to apply the following result of Sunklodas. 

Lemma 7.7 (Sunklodas [55]). Let d, s G (2,3], k, X, cq be fixed positive constants. Then 
there exists a constant K such that for all positive integers n and random variables Xi, X 2 , 
..Xn the following holds: 

If 

(1) E{Xj) = 0 for all j, 

(2) maxi<j<iiE(|Xj|®) < d, 

(3) the strong mixing condition 

sup |P(i4 nB)- P{A)P{B)\ < Ke-^^ 

l<t<n—T 

holds for all r (where and Qt+r are the a-algebras generated by Xi, ..., Xt and by 
Xt+T, ..., Xn, respectively) and 
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f4j the inequality 


holds, 
then we have 


^n = V >con 

S=i 


sup 




Y,X,<x)--S>{ 


i=i 
, 2 / 


< 


K{\og{Bn/^o)) 


s-1 


B, 


s-2 


where ^(x) = (27r) e du denotes the distribution function of a standard normal 

distribution. 


Remark 7.8. Actually, Sunklodas gives a stronger statement where A is not necessarily constant, 
but we will only need this version. Moreover, he technically considers an infinite sequence 
Xi,X 2 ,... of random variables and assumes that the conditions above hold for all n. However, 
the statement gives an explicit inequality for each fixed n, and the proof of this inequality 
given in [2S] only makes use of the conditions for the same fixed n. This is important for us, 
since we are not considering an infinite sequence, but rather a finite sequence of n random 
variables that all depend on n. 


Proof of Theorem \ VI1\ The qualitative behavior of the asymptotics of mean and variance 
follows from the moment generating functions Li and L 2 (see Lemmata 7.1 and 7.2) by using 
the standard tools from singularity analysis as explained earlier. We get the constants fitpi 
and atpi from Proposition 7.3 and Lemma 7.4 respectively, by inserting either the asymptotic 
expansion of qq, cf. Lemma 2.3 or the values of qq for given t (see also Remark 7.5). 

Asymptotic normality follows from Sunklodas’s result (Lemma 7.71 applied to the sequence 
Xj = Zj^n — — ^(Yj^n)), where Yj^n, Zj^n are defined as explained earlier in 

this section. Since \Xj\ is bounded by 1, the hrst condition of Lemma 7.7 is trivially satished 
(for any s). The second condition (strong mixing property) is exactly Lemma 7.6 and hnally 
we already know that the variance of the sum n~^ Zlj, which is equal to the variance of 
hnternai{T), is of linear order, because the variance of imtemai is of cubic order. 

Since the upper bound for A„ in Lemma |7.7| goes to 0 as n —)■ 00 , it follows that the 
distribution of imtemai (suitably renormalised) converges weakly to a Gaussian distribution. 
We can even conclude that the speed of convergence is 0(n“^/^ logn). □ 
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Appendix A. Details on the Variance of the Total Path Length 

In this appendix, more details of the proof of Lemma |7.4| are given. Some of these calculations 
were performed with computer assistance using SageMath j24] . 

For any variable z, we write = z^. Such an operator satisfies the following properties. 

Lemma A.l. For any expressions a, b and any variable z, we have 

+ b) = 4>^(a) + 4>2(6), ^> 2 ( 06 ) = ^>^(o )6 + a‘^^{b). 

Moreover, for a function f we have 


^zf = ^z[z^]=Z—+Z^ = <^J + Z^. 


dz 


dz 


dz"^ 


dz'^ 


If (zi,, Zk) eA f{zi ,..., Zk) is a k-ary function and ai, ..., Ok are expressions, we have 

^ df 

^z{f{ai,.. .,ak)) = ^(ai,.. .,ak)^z{aj). 

j=i j 

In view of the functions occurring in Section we have the following, more specific 
properties. 

Lemma A.2. Let {q,u,w) e->■ f{q,u,w) be a function and j be a non-negative integer. We 
have 


^u{f{q,q^^^u*'\w)) = ^{q,q^^K*\w)Fq^^K*' = F {^uf){q,q^^^u*\w), 
^w{f{q,q^^^u’'\w)) = {^^f){q,q^^K^\w). 

We also need derivatives of the products appearing throughout this article. 
Lemma A.3. Let 

{ 9 , «) = ![' 


W ...I . 1 .. M.fi 




J-l- 1 — giWti** 
2=1 ^ 


Then we have 


with 


<^uPj{q,u) = pj{q,u)Pj{q,u), 
<^qPj{q,u) = rj{q,u)Pj{q,u), 


Pjiq,u) = Y 


^ 1 — gWu** 


iiq,u) = Y 


“ 1 — gWu** 

1=1 ^ 

Proof. These results follow since {^zf) = f{z)/{l — z) for f{z) = z/(l — z). 


□ 
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Next, we consider the infinite sum 

S{q,u,w,MJ) := g, w,'u;)/(g, , R;)Pj(g, u), 

j>o 

where the function M depends on j, q, u and w, and the function f on q, u and w. Note that 
this notion is slightly more general than S(g, M, <h) of Section]^ The relationship between 
these two is 

S(g, M, ^>) = S{q, u, w, M, , 

where b is defined in Theorem HIl of this article. 

Taking derivatives yields the following results. 

Lemma A. 4. We have 


^qS{q,u,w,MJ) 

^uS{q,u,w,MJ) 

^ujS{q,u,w,MJ) 


S{q, u, w, 4>gM, /) + S{q, u, w, M, 4>g/) + S{q, u, w, M[[j]], 4>„/) 

+ S{q,u,w,Mrj{q,u),f), 

S{q, u, w, f) + S{q, u, w, MP,<^uf) + S{q, u, w, Mpj{q, u),f), 
S{q, u, w, Mj, f) + S{q, u, w, 4>^„M, /) + S{q, u, w, M, 4>^/), 


where M g{j, q, u, w) is short for {j, q, u, w) i—)■ M{j, q, u, w) g{j, q, u, w). 


We are now on our way to derive an expression for L 2 {q, 1,1) suitable for doing singularity 
analysis (cf. the proof of Lemma 7.4). As a first step, using the properties above we obtain 
the following expression for L 2 {q,l,l) (only leading terms): 


6 (4>^a) (g, 1,1) (g, 1,1) S (g, 1,1,1, (4>„fe) (g, 1,1))^ 

(1 - 6(g,l,l))4 

8 i<^wb) (g, 1,1) S (g, 1,1,1, (4>„4>^6) (g, 1,1)) S (g, 1,1,1, (4>„6) (g, 1,1)) a (g, 1,1) 

(1 - b{q, 1,1))4 

6 {<^wb) (g, 1,1)^ S {q, 1,1,1, (4>^b) (g, 1,1))^ a {q, 1,1) 

(1 -b{q,l,l))^ 

2 {•^‘jb) (g, 1,1) S (g, 1,1,1, (4>»6) (g, 1,1))^ a (g, 1,1) 

(1 - b{q, 1,1))4 

_ 2 {<^^b) (g, 1,1)^ S {q, 1,1,1, (4>^fe) (g, 1,1)) a {q, 1,1) 

(1 - 6(g, 

(g, (g, 1,1,1,5* (g, 1, l,P,{<^lb) (g, 1,1))) a (g, 1,1) 

{l-b{q,l,l)y 

4 i^wb) (g, 1,1)^ S {q, 1,1,1, S {q, 1, l,Pj{q, 1), (4>^b) {q, 1,1))) a {q, 1,1) 

(1 -6(g, 1,1))4 

2 {<^>wb) (g, 1,1) S (g, 1,1,1, (4>„6) (g, 1,1)) S (g, 1, l,j, (g, 1,1)) a (g, 1,1) 

(1-6(g,l,l))4 

For readability, we have not written the e-)■ formally needed in the formula above; for example, 
the S'-function in the first summand should read as 


S {q, 1,1, (j, q, u, w) HA 1, (g, u, w) i-A (4>„6) (g, 1,1)) 

Compared to the formula found in the proof of Lemma |7.4[ nested S'-functions appear. As a 
next step, we simplify these nested 5-functions by means of the following lemma. 
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Lemma A. 5. We have 

S hA S {q,u,l,{j,q,u,w) ^ pj{q,u),f)) 


and 


= sU,1A,U,Q,u,w) ^ t ^ ^ I.j , / 

' i=l ^ 


s (q,lAA,{q,u,w) hA S (q,u,l,{j,q,u,w) ^ , f 


= s(^q,lA,U,q,u,w) hA Ij + 11, 


Proof. We have 

5'(g,l,l,l,(g,u,u;) HA S {q,u,l,{j,q,u,w) ha M {j,q,u,w), f)) 

= ^ M{j,q,u,w),f)Pk{q,l) 


k>0 

k>0 j>0 

= E U 0 /A,|^-U) n n 

j,k>0 i=l ^ 2=1 ^ ^ 

= E (-i)'=«M(i,,,,w,uo/A,t+Ai) n ri rS 

J,/c>0 2 = 1 ^ 2 = 1 ^ 

j,/c>0 


With the substitution i = j + k, this equals 


q^^^,^)Pe{q, i) q,qV ■?!,«;) 

j=0 


£>0 


= Siq,l,l,{j,q,u,w) HA ^Kw),f 

^ k=0 

We now compute the inner sums occurring in the simplified expressions for the nested S- 
functions. The second one is simply J2k=o^^ ~ I-l + ^1- 

j k 


^ Pk{q, q^^ ^ ^ ^ _ iq ij-kjp H 

k=0 k=0i=l y y l<i<k<j 

With the substitution i + j — k = i, this equals 

P ^ f 


f 


, 1 _ qli+j-kj ' 


E 


. 1 ^ 


l<2<2+ji'—^<J 

The result now follows. 


1 — gM 


“El M “ E 1 _ M E “ E 




1 - q^'^^ ^ 1 £^i^~ ' 


□ 
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We continue to rewrite Using the previous lemma, we have 


^ 2 ( 9 ,1,1) = 


+ 


(1 - 1,1))5 (1 - 6(g, 1,1))4 


+ 0((l-6(g,l,l))-^) 


for suitable V^{q) and Vi{q). Using the fact that 5(go, 1,1) = 1 and the expression for <l>q/ of 
Lemma |A.1[ we get 


1 - 1,1) = 1 - (^1 + (9 - qo)^{qo, 1 , 1 ) + ~ ^0)^)^ 

= (1 - —)(^>g^)(®,l,l) “ ^ (90,1,1) + 0 {{q-qof) 

\qoJ \ qoJ 2 


We also have 

V^iq) = V5{qo) + {q-qo)^{qo) + 0{{q-qo)‘^) = 14 (^ 0 )- - ■^^{‘^qV5){qo) + 0{{q-qo)^). 

Therefore, we obtain 


^ 2 ( 9 ,1,1) = 


1^5 (go) 


((^g^)(go,i,i))® 


+ - 


{<^qV5){qo) 


1-^ 

go 

s2 


-5 


((^g&)(go,i,i))' 


+ - 


5V5{qo){^ib - <^qb){qo, 1 , 1 ) 


l"4(g) 


2((c^,6)(®,l,l))^ 


((c^,6)(go,l,l))^ 


1 - 


go 


-4 


+ 0( (1-2L 

go 


-3 


an expression which is suitable for singularity analysis. 
Lemma A.6. We have 


[g^ 


1 -^ 

go 


-5 


24 


10 


n 


1 




i + Lr + ^(T2)) 


n 


5n^ 


The previous lemma follows directly by expanding into a binomial series. We can use it to 
extract coefficients of L 2 (g, 1,1) and obtain 


[gT^2(g, 1,1) = go 


U5(go) 


r-n 


+ 


24((cl>,6)(go,l,l))5 
5U5(go) (^gU5)(go) 


(l2((4>g6)(go, 1,1))5 6(($q6)(go, 1,1))® 


+ 


5V'5(go)(^>g^ - ^>gft)(go, 1,1) 
12((c^g6)(go,l,l))6 
1^4(g) 

6(($g6)(go, 1,1))^ 


+ 


+ 0(n^)^. 
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We conclude that the second moment of the total path length is 
^ 5 (®) 4 


24((^>q6)(go, 1, l))^ao(®, 1,1) 
514(^0) 


n 


+ 




(^gV'5)(go) 5V'5 (go)(4>^6 - 4>g6)(go, 1,1) 


12((ch,6)(go,l,l))' 


+ 


VM 


n 


Similarly, writing 

^ 1 ( 9 , 1 , 1 ) = 


Mq) 


+ 


6((4>q6)(go, 1,1))^/ ao{qo, 1,1) 

E2{q) 


+ 0{n^ 


and performing singularity analysis shows that the expectation is 

^ 3 ( 90 ) 


+ 0((1-6(g,l,l)) ) 


2((^g^)(®,l,l))^ao(go,l,l) 

3-^3 (q'o) 


n 


+ 


2((c^,6)(go,l,l))2 ((chg6)(go,l,l))2 


(4>g£;3)(go) ?,E^{qQ){^lb - ^qh){qo, 1,1) 


+ 


2((c^,6)(go,l,l))3 

E2{q) 


($g6)(go,l,l)y ao(®,l,l) 


n 


+ 0 ( 1 ). 


From the results above an expression for the constant that occurs in the asymptotic 
formula for the variance follows. Using Lemma A.5 to rewrite the nested S-functions gives the 
result that was stated in Lemma 17.41 
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